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Polvketid es and their synthesis 



Field of Invention 

The present invention relates to processes and materials (including recombinant 
strains) for the preparation and isolation of macrolide compounds, particularly compounds 
differing from natural compounds at least in terms of glycosylation. It is particularly 
concerned with erythromycin and azithromycin analogues wherein the natural sugar at the 5- 
position has been replaced. The invention includes the use of recombinant cells in which 
gene cassettes are expressed to generate novel macrolide antibiotics. 

Background to the Invention 

The biosynthetic pathways to the macrolide antibiotics produced by actinomycete 
bacteria generally involve the assembly of an aglycone structure, followed by specific 
modifications which may include any or all of: hydroxylation or other oxidative steps, 
methylation and glycosylation. In the case of the 14-membered macrolide erythromycin A 
these modifications consist of the specific hydroxylation of 6-deoxyerythronolide B to 
erythronolide B which is catalysed by EryF, followed .by the sequential attachment of 
mycarose via the hydroxyl group at C-3 catalysed by the mycarosyltransferase EryBV 
(Staunton and Wilkinson, 1997). The attachment of desosamine via the hydroxyl group at C- 
5, catalysed by EryCm, then results in the production of erythromycin D, the first 
intermediate with antibiotic activity. Erythromycin D is subsequently converted to 
erythromycin A by hydroxylation at C-12 (EryK) and O-methylation (EryG) on the 
mycarosyl group, this order being preferred (Staunton and Wilkinson, 1997). The 
biosynthesis of dTDP-L-mycarose and dTDP-D-desosamine has been studied in detail 
(Gaisser et a!., 1997; Summers et al., 1997; Gaisser et al, 1998; Salah-Bey et al, 1 998). 

Recently 3.1 A high-resolution X-ray investigation of the interaction of ribosomes 
with macrolides (Schliinzen et al, 2001, Hansen et al, 2002) has revealed key interactions 
giving direct insights into ways in which macrolide templates might be adapted, by chemical 
or biological approaches, for increased ribosomal binding and inhibition and for improved 
effectiveness against resistant organisms. In particular, previous indications about the 
importance of the sugar substituent at the C-5 hydroxyl of the macrocycle for ribosomal 
binding are fully borne out by the structural analysis; this substituent extends towards the 
peptidyl transferase centre and in the case of 16 membered macrolides, which bear a 
disaccharide at C-5, reaches further into the peptidyl transferase centre, thus providing a 
molecular basis for the observation that 16 membered macrolides inhibit ribosomjd'capacity 



to form even a single peptide bond (Poulsen et al, 2000). This suggests that erythromycins 
with alternative substituents at the C-5 positions, for example mycaminosyl and 
angolosaminosyl erythromycins, and in particular mycaminosyl and 4'-0 substituted 
mycaminosyl erythromycins, are highly desirable as potential anti-bacterial agents. 
5 Since post-polyketide synthase modifications are often critical for biological activity 

(Liu and Thorson, 1994; Kaneko et al, 2000), there has been increasing interest in 
understanding the mechanism and specificity of the enzymes involved to engineer the 
biosynthesis of diverse novel hybrid macrolides with potentially improved activities. Recent 
work has demonstrated that the manipulation of sugar biosynthetic genes is a powerful 
10 approach to isolate novel macrolide antibiotics. The recently demonstrated relaxed specificity 
of the glycosyltransferases is crucial for this approach (see Mendez and Salas, 2001 and 
references therein). In the pathways to erythromycin A and methymycin / neomethymycin, 
the production of hybrid macrolides has been observed after inactivation of specific genes 
involved in the biosynthesis of deoxyhexoses (Gaisser et al, 1997; Summers et al, 1997; 
15 Gaisser et al., 1998; Salah-Bey et al., 1998; Zhao et al, 1998a; Zhao et al., 1998b) or after 
the expression of genes from different biosynthetic gene clusters (Zhao et al., 1999). A 
relaxed specificity towards the sugar substrate has also been reported for glycosyltransferases 
that have been expressed in heterologous strains, including glycosyltransferases from the 
pathways to vancomycin (Solenberg et al, 1997), elloramycin (Wohlert et al, 1998), 
20 oleandomycin (Doumith et al, 1999; Gaisser et al., 2000), pikromycin (Tang and McDaniel, 
2001), epirubicin (Madduri et al, 1998), avermectin (Wohlert et al, 2001) and spinosyn 
(Gaisser et al, 2002a). Most of the successful alterations so far reported have involved 
relaxed specificity towards the activated sugar moiety, while as yet only isolated examples are 
known where a glycosyltransferase targets its deoxysugar to an alternative aglycone substrate 
25 (Spagnoli et al, 1983; Trefzer et al, 1999). Both WO 97/23630 and WO 99/05283 describe 
the production of erythromycins with an altered glycosylate pattern in culture supernatants 
by deletion of a specific sugar biosynthesis gene. Thus WO 99/05283 describes low but 
detectable levels of 5-0-dedesosammyl-5-0-mycaminosyl erythromycin D in the culture 
supernatant of an eryCW knockout strain of S. erythraea. It also has been demonstrated that 
30 the use of the gene cassette technology described in patent WO01/79520 is a powerful and 
potentially general approach to isolate novel macrolide antibiotics by expressing 
combinations of genes in mutant strains of S. erythraea (Gaisser et al, 2002b). WO 01/79520 
also describes the detection of 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A in 
culture supernatants of the S. erythraea strains SGQ2pSGCIII and SGQ2p(mycaminose)Cm, 
35 fed with 3-O-mycarosyl erythronolide B. However, the low levels of S-O-dedesosaminyl-5-O- 
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mycaminosyl erythromycin A make this a less than optimal method for producing this 
valuable material on large scales and similar problems were encountered synthesizing 5-0- 
dedesosaminyl-5-0-mycaminosyl erythromycin A using chemical methods (Jones et aL 9 
1969). EP 1024145 refers to the isolation of azithromycin analogues carrying a mycaminosyl 
5 residue such as 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin and 3"-desmethyl-5-0- 
dedesosaminyI-5-0-mycaminosyl azithromycin. However the only examples given in this 
area are "prophetic examples" and there is no evidence that they could actually be put into 
practice. 

Therefore, the present invention provides the first demonstration of an efficient and 
10 highly effective method for making significant quantities of erythromycins and azithromycins 
which have non-natural sugars at the C-5 position, in particular mycaminose and 
angolosamine. In a specific aspect the present invention provides for the synthesis of 
mycaminose and angolosamine using specific combinations of sugar biosynthetic genes in 
gene cassettes. 

15 

Summary of the Invention 

The present invention relates to processes, and recombinant strains, for the 
preparation and isolation of erythromycins and azithromycins, which differ from the 
corresponding naturally occurring compound in the glycosylation of the C-5 position. In 

20 particular, the present invention relates to processes and recombinant strains for the 
preparation and isolation of 5-0-dedesosaminyl-5-0-mycaminosyl a or angolosaminyl 
erythromycins and azithromycins, in particular 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycins, and specifically 5- 
0-dedesosaminyl-5-0-mycaminosyl erythromycin B, 5-0-dedesosaminyl-5-0-mycaminosyl 

25 erythromycin C, 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin D, 5-0-dedesosaminyl- 
5 - 0-mycaminosy 1 erythromycin A, and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. . 
The present invention further relates to novel 5-0-dedesosaminyl-5-0-mycaminosyl, 
angolosaminyl erythromycins and azithromycins produced thereby. 

3 0 Detailed description of the Invention 

The present invention relates to processes, and recombinant strains, for the 
preparation and isolation of erythromycins and azithromycins which differ from the naturally 
occurring compound in the glycosylation of the C-5 position. These are referred to herein as 
"compounds of the invention" and unless the context dictates otherwise, such a reference 
35 includes a reference to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycins, 5-0- 
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dedesosaminyl-5-O-angolosaminyl erythromycins, 5-O-dedesosaminy 1-5 -O-mycaminosy 1 
azithromycins, and 5-O-dedesosaminyl-S-O-angolosaminyl azithromycins, specifically 5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycin A, 5 - O-dedesosaminy 1-5 - (9-my caminosy 1 
erythromycin C, 5-0-dedesosaminyl-5-O-mycaminosyl erythromycin B, 5-0-dedesosaminyl- 
5 5-O-mycaminosyl erythromycin D, 5-0-dedesosaminyl-5-O-mycaminosyl azithromycin, 5-0- 
dedesosaminyl-5-O-angolosaminyl erythromycin A, 5-0-dedesosaminyl-5-0-angolosaminyl 
erythromycin B, 5-0-dedesosaminyl-5-0-angolosaminyl erythromycin C, 5-0- 
dedesosaminyl-5-0-angolosaminyl erythromycin D, 5-0-dedesosaminyl-5-0-angolosaminyI 
azithromycin and analogues thereof which additionally vary in glycosylation at the C3 

10 position (see WO 01/79520) and which may also vary in the aglycone backbones (see WO 
98/01571, EP 1024145, WO 93/13663, WO 98/49315). The invention relates to processes, 
and recombinant strains, for the preparation and isolation of compounds of the invention. The 
present invention further relates to novel 5-0-dedesosaminyl-5-0-angolosaminyl 
erythromycins and azithromycins produced thereby (Figure 1). The methodology comprises 

15 in part the expression of a gene cassette in the S. erythraea mutant strain SGQ2 (which carries 
genomic deletions in eryA, eryCIII, eryBVand eryCW (WOO 1/79520)), as described in . 
Example 3 and 6 and in S. erythraea Q42/1 (BIOT-2166) (Examples 1- 4) and S. erythraea 
18A1 (BIOT-2634) (Example 6). Detailed descriptions are given in Examples 1-11. 

The invention relates to a process involving the transformation of an actinomycete 

20 strain, including but not limited to strains of S. erythraea such as SGQ2, (see WO 01/79520) 
or Q42/1 or 18A1 (whose preparation is described below) with an expression plasmid 
containing a combination of genes which are able to direct the biosynthesis of a sugar moiety 
and direct its subsequent transfer to an aglycone or pseudoaglycone. 

In a particular embodiment the present invention relates to a gene cassette containing 

25 a combination of genes which are able to direct the synthesis of mycaminose in an appropriate 
strain background. The gene cassette may include genes selected from but not . limited to 
angorfl4, tylMIII, tylMI, tyIB 9 tylAI, tylAII, tylla, angAI, angAII, angMIIl angB, angMI, 
eryG, eryK and glycosyltransferase genes including but not limited to tylMII, angMH, desVII, 
eryCIII, eryBV, spnP t and midL In a preferred embodiment the gene cassette comprises 

30 angorfl4 in combination with one or more other genes which are able to direct the synthesis 
of mycaminose. In an more preferred embodiment the gene cassette comprises angAI, 
angAIl angorfl4, angMIIl, angB t angMI, in combination with one or more 
glycosyltransferases such as but not limited to eryCIII, tylMII, angMII, In an alternative 
embodiment the gene cassette comprises tylAI, tyhill, tylMIII, tylB, tylla, tylMIm 
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combination with glycosyltransferases such as but not limited to eryCIH, tylMII and angMIL 
In a preferred embodiment the strain is an S. erythraea strain. 

In a particular embodiment the present invention relates to a gene cassette containing 
combinations of genes which are able to direct the synthesis of angolosamine, including but 

5 not limited to angMIII, angM, angB, angAI, angAII, angorfl4, angorf4, tylMIII, tylMI, tylB, 
tylAI, tylAII, eryCVI, spnO, eryBVI, and errand one or more glycosyltransferase genes 
including but not limited to eryCIU, tylMH, angMIl, desVII, eryBV, spnP and midL In a 
preferred embodiment the gene cassette contains angMIII, angMI, angB, angAI, angAII, 
angorf!4, spnO in combination with a glycosyltransferase gene such as but not limited to 

10 angMIl, tylMI or eryCIU, In a preferred embodiment the strain is an S. erythraea strain. 

In one embodiment, the process of the present invention further involves feeding of 
an aglycone and/or a pseudoaglycone substrate (for definition see below), including (but not 
limited to) 3-O-mycarosyl erythronolide B, erythronolide B, 6-deoxy erythronolide B, 3-0- 
mycarosyl-6-deoxy erythronolide B, tylactone, spinosyn pseudoaglycone, 3-0-rhamnosyl 

15 erythronolide B, 3-0-rhamnosyl-6-deoxy erythronolide B to cultures of the transformed 

actinomycete strains, the bioconversion of the substrate to compounds of the invention and 
optionally the isolation of said compounds. This process is exemplified in Examples 1-1 1 . 
However, a person of skill in the art will appreciate that in an alternative embodiment the host 
cell can express the desired aglycone template, either naturally or recombinantly. 

20 As used herein, the term "pseudoaglycone" refers to a partially glycosylated 

intermediate of a multiply-glycosylated product. 

Those skilled in the art will appreciate that alternative host strains can be used. A 
preferred cell is a prokaryote or a fungal cell or a mammalian cell. A particularly preferred 
host cell is a prokaryote, more preferably host cell strains such as actinomycetes, 

25 Pseudomonas, myxobacteria, and 2s. colu It will be appreciated that if the host cell does not 
naturally produce erythromycin, or a closely related 14-membered macrolide, it may be 
necessary to introduce a gene conferring self-resistance to the macrolide product, such as 
ermE from S. erythraea. Even more preferably the host cell is an actinomycete, even more 
preferably strains that include but are not limited to S. erythraea, Streptomyces griseqfuscw, 

30 Streptomyces cinnamonensis, Streptomyces albus, Streptomyces Iividans, Streptomyces 
hygroscopicus sp. 9 Streptomyces hygroscopicus var. ascomyceticus, Streptomyces 
longisporqflavus, Saccharopolyspora spinosa, Streptomyces tsukubaensis, Streptomyces 
coelicolor, Streptomyces fradiae, Streptomyces rimosus, Streptomyces avermitilis, 
Streptomyces eurythermus, Streptomyces venezuelae, Amycolatopsis mediterranei. In a more 

35 highly preferred embodiment the host cell is S. erythraea. 
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It will readily occur to those skilled in the art that the substrate fed to the recombinant 
cultures of the invention need not be a natural intermediate in erythromycin biosynthesis. 
Thus, the substrate could be modified in the aglycone backbone (see Examples 8-1 1) or in the 
sugar attached at the 3-position or both. WO 01/79520 demonstrates that the desosaminyl 
5 transferase EryCIII exhibits relaxed specificity with respect to the pseudoaglycone substrate, 
converting 3-0-rhamnosyl erythronolides into the corresponding 3-0-rhamnosyI 
erythromycins. Appropriate modified substrates may also be produced by chemical semi- 
synthetic methods. Alternatively, methods of engineering the erythromycin-producing 
polyketide synthase, DEBS, to produce modified erythromycins are well known in the art (for 

10 example WO 93/13663, WO 98/01571, WO 98/01546, WO 98/493 15, Kato, Y. et al 9 2002 ). 
Likewise, WO 01/79520 describes methods for obtaining erythronolides with alternative 
sugars attached at the 3-position. Therefore, the term "compounds of the invention" includes 
all such non-natural aglycone compounds as described previous additionally with alternative 
sugars at the C-5 position. All these documents are incorporated herein by reference. 

15 It will readily occur to those skilled in the art that the compounds of the invention 

containing a mycaminosyl moiety at the C-5 position could be modified at the C4 hydroxyl 
group of the mycaminosyl moiety, including but not limited to glycosylation (see also WO 
01/79520), acylation or chemical modification. 

The present invention thus provides variants of erythromycin and related macrolides 

20 having at the 5-position a non-naturally occurring sugar, in particular an O-mycaminosyl, or 
angolosaminyl residue or a derivative or precursor thereof, specifically an 0- angolosaminyl 
residue or a derivative thereof. 

The term "variants of erythromycin" encompasses (a) erythromycins A, B, C and D; 
(b) semi-synthetic derivatives such as azithromycin and other derivatives as discussed in EP 

25 1024145, which is incorporated herein by reference; (c) variants produced by genetic 

engineering and semi-synthetic derivatives thereof. Variants produced by genetic engineering 
include variants as taught in, or producible by, methods taught in WO 98/01571, EP 1024145, 
WO 93/13663, WO 98/493 15 and WO 01/79520 which are incorporated herein by reference. 
The compounds of the invention include variants of erythromycin where the natural sugar at 

30 position C5 has been replaced with mycaminose or angolosamine and also includes 

compounds of the following formula (1) and pharmaceutically acceptable salts thereof. No 
stereochemistry is shown in Formula 1 as all possibilities are covered, including "natural" 
stereochemistries (as shown elsewhere in this specification) at some or all positions. 

35 Formula I: 
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R 1= H, CH 3 , C^l 5 or selected from i) see below 

R 2 , R 4 , R 5 , R 6 , R 7 and R 9 are each independently H, OH, CH 3 , C2H 5 or OCH 3 
R 3 = H or OH 
R 8 = Hor 

J * r OR 10 
U^_^-U^. or selected from iv) see below 

R ,0 =HorCH 3 oracyl 

R ll = H or I OR 10 

\^A^-or 12 

R 12 = H oracyl 
R 13 = HorCH 3 



nl6 

K \ NMe 2 

Rl5 = J <vZ5^ZT ORU 



R ,6 = HorOH 

R 14 = H or-C(0)NR°R d wherein each of R° and R d is independently H, Ci-do alkyl, C 2 -C 20 
alkenyl, C 2 -C, 0 alkynyl, -(CH 2 ) m (C6-C, 0 aryl), or-CCHzUS-lO membered heteroaryl), 
wherein m is an integer ranging from 0 to 4, and wherein each of the foregoing R c and R d 
groups, except H, may be substituted by 1 to 3 Q groups; or wherein R° and R d may be taken 
together to form a 4-7 membered saturated ring or a 5-10 membered heteroaryl ring, wherein 
said saturated and heteroaryl rings may include 1 or 2 heteroatoms selected from O, S and N, 
in addition to the nitrogen to which R c and R d are attached, and said saturated ring may 



include 1 or 2 carbon-carbon double or triple bonds, and said saturated and heteroaryl rings 
may be substituted by 1 to 3 Q groups; or R 2 and R 17 taken together form a carbonate ring; 
each Q is independently selected from halo, cyano, nitro, trifluoromethyl, azido, -C(0)Q l , - 
0C(O)Q 1 J -0(0)00', -OC(0)OQ\ -NQ 2 C(0)Q 3 , -0(O)NQ 2 Q 3 , -NQ 2 Q 3 , hydroxy, C,-C 6 
alkyl, C,-C 6 alkoxy, -(CH 2 ) ra (C 6 -C, 0 aryl), and-(CH 2 ) m (5-10 membered heteroaryl), wherein 
m is an integer ranging from 0 to 4, and wherein said aryl and heteroaryl substituents may be 
substituted by 1 or 2 substituents independently selected from halo, cyano, nitro, 
trifluoromethyl, azido, -0(0)0', -C(0)OQ', -00(0)00', -NQ 2 C(0)Q 3 , -C(0)NQ 2 Q 3 , - 
NQ 2 Q 3 , hydroxy, C,-C 6 alkyl, and Ci-C 6 alkoxy; 

each Q l , Q 2 and Q 3 is independently selected from H, OH, C,-C, 0 alkyl, C,-C 6 alkoxy, 
C 2 -C 10 alkenyl, C 2 -C, 0 alkynyl, -(CH 2 )m(C 6 -C 10 aryl), and-(CH 2 ) m (5-10 membered 
heteroaryl), wherein m is an integer ranging from 0 to 4; with the proviso that the compound 
is not 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A or D. 

The present invention also provides compounds according to formula I above in which: 
15 i) the substituent R 1 is selected from 

- an alpha-branched C 3 -C 8 group selected from alkyl. alkenyl, alkynyl, 
alkoxyalkyl and alkylthioalkyl groups any of which may be optionally 
substituted by one or more hydroxyl groups; 

- a C 5 -Cg cycloalkylalkyl group wherein the alkyl group is an alpha-branched 

20 C 2 -C 5 alkyl group 

- a C 3 -C 8 cycloalkyl group or C 5 -C 8 cycloalkenyl group, either of which may 
optionally be substituted by one or more hydroxyl, or one or more d-C 4 
alkyl groups or halo atoms 

- a 3 to 6 membered oxygen or sulphur containing heterocyclic ring which may 
25 be saturated, or fully or partially unsaturated and which may optionally be 

substituted by one or more C,-C 4 alkyl groups, halo atoms or hydroxyl groups. 

- phenyl which may be optionally substituted with at least one substituent 
selected from C,-C 4 alkyl, C,-C 4 alkoxy and C,-C 4 alkylthio groups, halogen 
atoms, trifluoromethyl, and cyano or 

30 . R* is R l7 -CH 2 - where R 17 is H, C,-C 8 alkyl, C 2 -C„ alkenyl, C 2 -C 8 alkynyl, 

alkoxyalkyl or alkylthioalkyl containing from 1 to 6 carbon atoms in each 
alkyl or alkoxy group wherein any of said alkyl, alkoxy, alkenyl or alkynyl 
groups may be substituted by one or more hydroxyl groups or by one or more 
halo atoms; or a C 3 -C 8 cycloalkyl or C 5 -Cg cycloalkenyl either of which may 

35 be optionally substituted by one or more C,-C 4 alkyl groups or halo atoms; or 
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a 3 to 6 membered oxygen or sulphur containing heterocyclic ring which may 
be saturated or fully or partially unsaturated and which may optionally be 
substituted by one or more C r C 4 alkyl groups or halo atoms; or a group of 
the formula SAj 6 wherein A !6 is C r C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, 
C 3 -C 8 cycloaikyl, C 5 -C 8 cycloaikenyl, phenyl or substituted phenyl wherein 
the substituent is C r C 4 alkyl, C r C 4 alkoxy or halo, or a 3 to 6 membered 
oxygen or sulphur-containing heterocyclic ring which may be saturated, or 
fully or partially unsaturated and which may optionally be substituted by one 
or more C r C 4 alkyl groups or halo atoms 

ii) the substituents R 2 , R 4 , R 5 , R 6 , R 7 and R 9 are each, independently, H, OH, CH 3 , 
C 2 H 5 , OCH 3 

iii) the -CHOH- at CI 1 (erythromycins) or C12 (azithromycins) is replaced by a 
methylene group (-CH2-), a keto group (C=0), or by a 10,1 1-olefinic bond 
(erythromycins) or 11,12-olefinic bond (azithromycins) 

iv) R 8 includes but is not limited to rhamnose, 2'-0-methyl rhamnose, 2\3'-bis-0- 
methyl rhamnose, 2',3',4'-tri-0-methyl rhamnose, oleandrose, oliose, digitoxose 
or olivose 

v) the substituent R u is H or mycarose or C4-0-acyl-mycarose or glucose 
The present invention also provides compounds according to formula I above which 

differ in the oxidation state of one or more of the ketide units (i.e. selection of alternatives 
from the group: -CO-, -CH(OH)-, alkene -CH-, and CH 2 ) where the stereochemistry of any - 
CH(OH)- is also independently selectable. 

Novel 5-O-dedesosaminyl-5-0-angolosaminyl erythromycins and azithromycins 
made available by this aspect of the invention include, but are not limited to those where in 
the R 15 group R u = R 16 = H, with the proviso that they are not angolamycin or medermycin 
(Kinumaki and Suzuki, 1972; Ichinose etal, 2003). 

Additionally, a person of skill in the art will appreciate that using the methods of the 
present invention mycaminose and angolosamine may be added to other aglycones or 
pseudoaglycone for example (but without limitation) tylactone or spinosyn pseudoaglycone. 
These other aglycones or pseudoaglycones may be the naturally occurring structure or they 
may be modified in the aglycone backbone, such modified substrates may be produced by 
chemical semi-synthetic methods (Kaneko et aL 9 2000 and references cited therein), or, 
alternatively, via PKS engineering, such methods are well known in the art (for example WO 
93/13663, WO 98/01571, WO 98/01546, WO 98/49315, Kato, Y. et al 9 2002) ). 



Moreover, the process of the host cell selection further comprises the optional step of 
deleting or inactivating or adding or manipulating genes in the host cell. This process 
comprises the improvement of recombinant host strains for the preparation and isolation of 
compounds of the invention, in particular 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycins, specifically 5-0- 
dedesosaminyI-5-0-mycaminosyl erythromycin A, 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin C, 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B, 5-0-dedesosaminyl- 
5-0-mycaminosyl erythromycin D and 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. 
This approach is exemplified in Example 1 by introducing an eryBVI mutation into the 
chromosome of S. erythraea SGQ2 in order to optimise the conversion of the substrate 3-0- 
mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycins. 

In a further aspect the invention relates to the construction of gene cassettes. The 
cloning method used to isolate these gene cassettes is analogous to that used in 
PCT/GB03/003230 and diverges significantly from the approach previously described (WO 
01/79520) by assembling the gene cassette directly in an expression vector rather than pre- 
assembling the genes in pUC 18/19 plasmids, thus providing a more rapid cloning procedure 
for the isolation of gene cassettes. The strategy for isolating these gene cassettes is 
exemplified in Example 1 to Example 1 1. A schematic overview of the strategy is given in 
Figure 2. 

Another aspect of the invention allows the enhancement of gene expression by 
changing the order of genes in a gene cassette, the genes including but not limited to tylMI, 
tylMUI, tylB, eryCVI, tylAI, tylAII, eryCIII, eryBV, angAI, angAII, angMIIl angB, cmgMI, 
angorfl4 t angor/4, eryBVI eryK, eryG, angMTl tylMII, desVII„midI, spnO, spnN, spnP and 
genes with similar ftmctions, allowing the arrangement of the genes in a multitude of 
permutations (Figure 2). 

The cloning strategy outlined in this invention also allows the introduction of a 
histidine tag in combination with a terminator sequence 3' of the gene cassette to enhance 
gene expression (see Example 1). Those skilled in the art will appreciate other terminator 
sequences well known in the art could be used. See, for example Bussiere and Bastia (1999), 
Bertram et al (2001) and Kieser etal (2000), incorporated herein by reference. 

Another aspect of the invention comprises the use of alternative promoters such as 
ptipA (Ali et al. 9 2002) and/or ptr (Salah-Bey et al, 1995) to express genes and/or assembled 
gene cassette(s) to enhance expression. 

Another aspect of the invention describes the multiple uses of promoter sequences in 
the assembled gene cassette to enhance gene expression as exemplified in Example 6. 
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Another aspect of the invention describes the addition of genes encoding for a NDP- 
glucose-synthase such as tylAI and a NDP-glucose-4,6-dehydratase such as tylAII to the gene 
cassette in order to enhance the endogenous production of the activated sugar substrate. Those 
skilled in the art will appreciate that alternative sources of equivalent sugar biosynthetic 
pathway genes may be used. In this context alternative sources include but are not limited to: 
TvlAI- homoloeues : DesIII of Streptomyces venezuelae (accession no AAC68682), 
GrsD of Streptomyces griseus (accession no AAD31799), AveBIH of Streptomyces 
avermitilis (accession no BAA84594), Gtt of Saccharopolyspora spinosa (accession 
no AAK83289), SnogJ of Streptomyces nogalater (accession no AAF01820), AclY of 
Streptomyces galilaeus (accession no BAB72036), LanG of Streptomyces cyanogenus 
(accession no AAD13545), Graorfl6(GraD) of Streptomyces violaceoruber 
(accession no AAA99940), OleS of Streptomyces antibioticus (accession no 
AAD55453) and StrD of Streptomyces griseus (accession no A26984) and AngAI of 
& eurythermus. 

TvlAII- homoloeues : AprE of Streptomyces tenebrarius (accession no AAG18457), 
GdH of 5. spinosa (accession no AAK83290), DesIV of S. venezuelae (accession no 
AAC68681), GdH of S. erythraea (accession no AAA6821 1), AveBII of & 
avermitilis (accession no BAA84593), Scf81.08C of Streptomyces coelicolor 
(accession no CAB61555), LanH of S. cyanogenus (accession no AAD13546), 
Graorfl7 (GraE) of S. violaceoruber (accession no S58686), OleE of S. antibioticus 
(accession no AAD55454), StrE of*?, griseus (accession no P29782) and AngAJI of 
S. eurythermus. 

Similarly, alternative sources for activated sugar biosynthesis gene homologues to 
tylMni angAIII, eryCII, tylMII, angMII, tylB, angB, eryCI, tylMI, angM, eryCVl tylla, 
angorfl4, angorf4, spnO, eryBVI, eryBV, eryCIU, desVII, midl spnNandspnP will readily 
occur to those skilled in the art, and can be used. 

Another aspect of the invention describes the use of alternative glycosyltransferases 
in the gene cassettes such as EryCm. Those skilled in the art will appreciate that alternative 
glycosyltransferases may be used. In this context alternative glycosyltransferases include but 
are not limited to: TylMII (Accession no CAA57472), DesVII (Accession noAAC68677), 
MegCm (Accession no AAG13921), MegDI (Accession no AAG13908) or AngMII of £ 
eurythermus. 

In one aspect of the present invention, the gene cassette may additionally comprise a 
chimeric glycosyltransferase (GT). This is particularly of benefit where the natural GT does 
not recognise the combination of sugar and aglycone that is required for the synthesis of the 
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desired analogue. Therefore, in this aspect the present invention specifically contemplates the 
use of a chimearic GT wherein part of the GT is specific for the recognition of the sugar 
whose synthesis is directed by the genes in said expression cassette when expressed in an 
appropriate strain background and part of the GT is specific for the aglycone or 
5 pseudoaglycone template (Hu and Walker, 2002). 

Those skilled in the art will appreciate that different strategies may be used for the 
introduction of gene cassettes into the host strain, such as site-specific integration vectors 
(Smovkinaef a/., 1990; Lee etal., 1991; Matsuura et aL, 1996; Van Mellaertefa/., 1998; 
Kieser et aL 9 2000). Alternatively, plasmids containing the gene cassettes may be integrated 

10 into any neutral site on the chromosome using homologous recombination sites. Further, for 
a number of actinomycete host strains, including S. erythraea, the gene cassettes may be 
introduced on self-replicating plasmids (Kieser et al. y 2000; WO 98/01571). 

A further aspect of the invention provides a process for the production of compounds 
of the invention and optionally for the isolation of said compounds. 

15 A further aspect of the invention is the use of different fermentation methods to 

optimise the production of the compounds of the invention as exemplified in Example 1 . 
Another aspect of the invention is the addition of ery genes such as eryK and/or eryG into the 
gene cassette. One skilled in the art will appreciate that the process can be optimised for the 
production of a specific erythromycin (i.e. A, B, C, D) or azithromycin by manipulation of the 

20 genes eryG (responsible for the methylation on the mycarose sugar) and/or eryK (responsible 
for hydroxylation at C 1 2). Thus, to optimise the production of the A-form, an extra copy of 
eryK may be included into the gene cassette. Conversely, if the erythromycin B analogue is 
required, this can be achieved by deletion of the eryK gene from the S. erythraea host strain, 
or by working in a heterologous host in which the gene and/or its functional homologue, is 

25 not present Similarly, if the erythromycin D analogue is required, this can be achieved by 
deletion of both eryG and eryK genes from the S. erythraea host strain, or by working in a 
heterologous host in which both genes and/or their functional homologues are not present. 
Similarly, if the erythromycin C analogue is required, this can be achieved by deletion of the 
eryG gene from the S. erythraea host strain, or by working in a heterologous host in which the 

30 gene and/or its functional homologues are not present. 

In this context a preferred host cell strain is a mammalian cell strain, fungal cells 
strain or a prokaryote. More preferably the host cell strain is Pseudomonas, mxyobacteria or 
E. coll In a more preferred embodiment the host cell strain is an actinomycete, still more 
preferably including, but not limited to Saccharopolyspora erythraea 9 Streptomyces 

35 coelicolor, Streptomyces avermitilis y Streptomyces griseofuscus, Streptomyces 
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cinnamonensis, Streptomyces fradiae, Streptomyces ewythermus, Streptomyces 
longisporoflavus, Streptomyces hygroscopicus, Saccharopolyspora spinosa, Micromonospora 
griseorubida, Streptomyces lasaliensis, Streptomyces venezuelae, Streptomyces antibioticus, 
Streptomyces lividans, Streptomyces rimosus, Streptomyces albus, Amycolatopsis 
5 mediterranei, Nocardia sp, Streptomyces tsukubaensis and Actinoplcmes sp. N902-109. In a 
still more preferred embodiment the host cell strain is selected from Saccharopolyspora 
erythraea, Streptomyces griseofuscus, Streptomyces cinnamonensis, Streptomyces albus, 
Streptomyces lividans, Streptomyces hygroscopicus sp., Streptomyces hygroscopicus var. 
ascomyceticus, Streptomyces longisporqflavus, Saccharopolyspora spinosa, Streptomyces 
10 tsukubaensis, Streptomyces coelicolor, Streptomyces fradiae, Streptomyces rimosus, 

Streptomyces avermitilis, Streptomyces ewythermus, Streptomyces venezuelae, Amycolatopsis 
mediterranei. In the most highly preferred embodiment the host strain is Saccharopolyspora 
erythraea. 

The present invention provides methods for the production and isolation of 
15 compounds of the invention, in particular of erythromycin and azithromycin analogues which 
differ from the natural compound in the glycosylation of the C-5 position, for example but 
without limitation: novel 5-O-dedesosaminyl-5-0-mycaminosyl or angolosaminyl 
erythromycins and 5-0-dedesosaminyl-5-0-mycaminosyl, or angolosaminyl azithromycins 
which are useful as anti-microbial agents for use in human or animal health. 
20 In further aspects the present invention provides novel products as obtainable by any 

of the processes disclosed herein. 

Brief description of Figures 

Figure J A: Structures of 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A, 5-0- 
25 dedesosaminyl-5-O-mycaminosyl erythromycin B and 5-O-dedesosammyl-5-0-mycaminosyI 
erythromycin C. 

Figure IB: Structure of 5-0-dedesosaminyl-5-0-mycaminosyl azithromycin. 

30 Figure 2: Schematic overview over the gene cassette cloning strategy. Vector pSG144 

was derived from vector pSG142 (Gaisser et aL, 2000). Abbreviations: dam: DNA isolated 
from dam strain background, XbaF 1 *: Xbal site sensitive to Dam methylation, eryRHS: DNA 
fragment of the right hand side of the ery-cluster as described previously (Gaisser et aL, 
2000). 

35 
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Figure 3: Amino acid comparison between the published sequence of TylAl (below) and the 
amino acid sequence detected from the sequencing data described in this invention (above). 
The changes in the amino acid sequence are underlined. 

5 Figure 4: Amino acid comparison between the published sequence of TylAII (below) and the 
amino acid sequence detected from the sequencing data described in this invention (above). 
The changes in the amino acid sequence are underlined. 
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Figure 5: Structure of 5-0-angolosaminyl tylactone. 

Figure 6: Shows an overview of the angolamycin polyketide synthase gene cluster. 

Figure 7: The DNA sequence which comprises orfl4 and or/15 (angB) from the 

angolamycin gene cluster. 

Figure 8: The DNA sequence which comprises or/2 (angAI), or/3 (angAII) and or/4 

from the angolamycin gene cluster. 

Figure 9: The DNA sequence which comprises or/1 * (angMIII), or/2 * (angMH),and 

20 or/3* (angMQ from the angolamycin gene cluster. 

Figure 10: The amino acid sequence which corresponds to orf2 (angAI). 

Figure 1 1: The amino acid sequence which corresponds to or/3 (angAII). 

25 

Figure 12: The amino acid sequence which corresponds to or/4. 

Figure 13: The amino acid sequence which corresponds to or/14. 

30 Figure 14: The amino acid sequence which corresponds to or/15 (angB), 

Figure 15: The amino acid sequence which corresponds to or/1* (angMIII). 

Figure 16: The amino acid sequence which corresponds to or/2* (angMU). 

35 
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Figure 1 7: The amino acid sequence which corresponds to or/3* (angMT). 



General Methods 

Escherichia coli XLl-Blue MR (Stratagene), E. coli DH10B (GibcoBRL) and E. coli 
ET12567 were grown in 2xTY medium as described by Sambrook et aL, (1989). Vector 
pUC18, pUC19 and Litmus 28 were obtained from New England Biolabs. E. coli 
transformants were selected with 100 ng/ml ampicillin. Conditions used for growing the 
Saccharopolyspora erythraea NRRL 2338-red variant strain were as described previously 
(Gaisser et cd. 9 1997, Gaisser et aL, 1998). Expression vectors in S. erythraea were derived 
from plasmid pSG142 (Gaisser et aL, 2000). Plasmid-containing S. erythraea were selected 
with 25-40 jig/ml thiostrepton or 50 jig/ml apramycin. To investigate the production of 
antibiotics, S. erythraea strains were grown in sucrose-succinate medium (Cafifrey et aL, 
1992) as described previously (Gaisser et aL 9 1997) and the cells were harvested by 
centrifugation. Chromosomal DNA of Streptomyces rochei ATCC21250 was isolated using 
standard procedures (Kieser et aL, 2000). Feedings of 3-Omycarosyl erythronolide B or 
tylactone were carried out at concentrations between 25 to 50 mg /L 

DNA manipulation and sequencing 

DNA manipulations, PCR and electroporation procedures were carried out as 
described in Sambrook et aL, (1989). Protoplast formation and transformation procedures of 
S. erythraea were as described previously (Gaisser et aL, 1997). Southern hybridizations were 
carried out with probes labelled with digoxigenin using the DIG DNA labelling kit 
(Boehringer Mannheim). DNA sequencing was performed as described previously (Gaisser et 
aL, 1997), using automated DNA sequencing on double stranded DNA templates with an ABI 
Prism 3700 DNA Analyzer. Sequence data were analysed using standard programs. 

Extraction and mass spectrometry 

1 ml of each fermentation broth was harvested and the pH was adjusted to pH 9. For 
extractions an equal volume of ethyl acetate, methanol or acetonitrile was added, mixed for at 
least 30 min and centrifuged. For extractions with ethyl acetate, the organic layer was 
evaporated to dryness and then re-dissolved in 0.5 ml methanol. For methanol and acetonitrile 
extractions, supernatant was collected after centrifugation and used for analysis. High 
resolution spectra were obtained on a Bruker BioApex II FT-ICR (Bruker, Bremen, FRG). 
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Analysis of culture broths 

An aliquot of whole broth (1 ml) was shaken with CH 3 CN (1 ml) for 30 minutes. The mixture 
was clarified by centrifiigation and the supernatant analysed by LCMS. The HPLC system 
comprised an Agilent HP1 100 equipped with a Luna 5 \\m C18 BDS 4.6 x 250 mm column 
5 (Phenomenex, Macclesfield, UK) heated to 40°C. The gradient elution was from 25% mobile 
phase B to 75% mobile phase B over 19 minutes at a flow rate of 1 ml/min. Mobile phase A 
was 10% acetonitrile: 90% water, containing 10 mM ammonium acetate and 0.15% formic 
acid, mobile phase B was 90% acetonitrile: 10% water, containing 10 mM ammonium acetate 
and 0.15% formic acid. The HPLC system described was coupled to a Bruker Daltonics 
10 Esquire3000 electrospray mass spectrometer operating in positive ion mode. 

Extraction and purification protocol- 
Vox NMR analysis of 5-Oniedesosaminyl-5-0-mycaminosyl erythromycin A the 
fermentation broth was clarified by centrifiigation to provide supernatant and cells. The 

1 5 supernatant was applied to a column (16 x 15 cm) of Diaion® HP20 resin (Supelco), washed 
with 10% Me 2 CO/H 2 0 (2x21) and then eluted with Ms^CO (3.5 1). The cells were mixed to 
homogeneity with an equal volume of Me2CO/MeOH (1:1). After at least 30 minutes the 
slurry was clarified by centrifiigation and the supernatant decanted. The pelleted cells were 
similarly extracted once more with Me2CO/MeOH (1:1). The cell extracts were combined 

20 with the Me2CO from the HP20 column and the solvent was removed in vacuo to give an 

aqueous concentrate. The aqueous was extracted with EtOAc (3 x) and the solvent removed 
in vacuo to give a crude extract. The residue was dissolved in CHsCN/MeOH and purified by 
repeated rounds of reverse phase (CI 8) high performance liquid chromatography using a 
Gilson HPLC, eluting a Phenomenex 21.2 x 250 mm Luna 5 pm CI 8 BDS column at 21 

25 ml/min. Elution with a linear gradient of 32.5% B to 63% B was used to concentrate the 

macrolides followed by isocratic elution with 30% B to resolve the individual erythromycins. 
Mobile phase A was 20 mM ammonium acetate and mobile phase B was acetonitrile. 
High resolution mass spectra were acquired on a Bruker BioApex II FTICR (Bruker, Bremen, 
Germany). 

30 

For NMR analysis of 5-0-angolosaminyl tylactone byconversion experiments were 
performed as previously described with four 2 1 flasks containing each 400 ml of SSDM 
medium inoculated with 5% of pre-cultures. Feedings with tylactone were carried out at 50 
mg/1. The culture was centrifuged and the pH of the supernatant was adjusted to about pH 9 
35 followed by extractions with three equal volumes of ethyl acetate. The cell pellet was 
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extracted twice with equal volumes of a mixture of acetone-methanol (50:50, vol/vol). The 
extracts were combined and concentrated in vacuo. The resulting aqueous fraction was 
extracted three times with ethyl acetate and the extracts were combined and evaporated until 
dryness. This semi purified extract was dissolved in methanol and purified by preparative 
HPLC on a Gilson 315 system using a 21 mm x 250 mm Prodigy ODS3 column 
(Phenomenex, Macclesfield, UK). The mobile phase was pumped at a flow rate of 21 ml/min 
as a binary system consisting of 30% CH 3 CN, 70% H 2 0 increasing linearly to 70% CH 3 CN 
over 20 min. 



Sequence Information 

Table 7- Sequence information f o r the anpolosamine biosvnthetic g enes included in the 
cassettes 



Gene (named according to tyl 
equivalent) 


Bases in Figure 


Corresponding polypeptide 
Figure number 


or/2 (angAI) 


14847-1573 lc from Figure 8 


Figure 10 

NDP-hexose synthase 


or/3 (ansAID 


ij/t y i *r / / *f c irom v igure o 


Figure 1 1 

NDP-hexose 4,6-dehydratase 


orf4 

(N-part) 
(C-part) 


1 1306-13666c from Figure 8 


Figure 12 

typell thioesterase 

NDP-hexose 2,3-dehydratase 


or/14 


1 162-2 160c from Figure 7 


Figure 13 

NDP-hexose 4-ketoreductase 


orfl5 (angB) 


33-1 15 lc from Figure 7 


Figure 14 

NDP-hexoseaminotransferase 


orfl* (cmgMIII) 


59800-61 140 from Figure 9 


Figure 15 

Hypothetical NDP hexose 3,4 
isomerase 


orf2* (angMH) 


61 159-62430 from Figure 9 


Figure 16 

angolosaminyl glycosyl 
transferase 


or/3* (cmgMT) 


62452-63 171 from Figure 9 


Figure 17 

N,N-dimethyl transferase 



Note : c indicates that the gene is encoded by the complement DNA strand 
potential functions of the predicted polypeptides (SEQ ID No.7 to 34) were obtained 
from the NCBI database using a BLAST search. 
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Example 1: Bioconversion of 3-0-mycarosyl erythronolide B to 5-0- 
dedesosaminyl-5-0-mycaminosyl erythromycins using gene cassette 
pSGl44tylAItylAIItylMIIItylBtylIatylMIeryCIIL 

Isolation of pSG143 

Plasmid pSG142 (Gaisser et al, 2000) was digested with Xbal and a fill-in reaction 
was performed using standard protocols. The DNA was re- ligated and used to transform E. 
coli DH10B. Construct pSG143 was isolated and the removal of thecal site was confirmed 
10 by sequence analysis. 

Isolation of pUClSeryBVcas 

The gene eryBV was amplified by PCR using the primers cas01eG21 (WOO 1/79520) 
and 7966 5'-GGGGAATTCAGATCTGGICTAGAGGTCAGCCGGCGTGGCGGCGCGTG 
1 5 AGTTCCTCCAGTCGCGGGACGATCT -3' and pSG142 (Gaisser et al., 2000) as template. 
The PCR fragment was cloned using standard procedures and plasmid P UC18eryBVcas was 
isolated with an Ndel site overlapping the start codon of eryBV and Xbal and BglR sites 
(underlined) following the stop codon. The construct was verified by sequence analysis. 

20 Isolation of vector pSGLitl 

The isolation of this vector is described in PCT/GB03/003230. 

Isolation of pSGLitl eryCIII 

Plasmid pSGCm (WO01/79520) was digested with NdeVBglU and the insert fragment was 
25 isolated and ligated with the NdeVBglR treated vector fragment of pSGLitl. The ligation was 
used to transform E. coli ET12567 and plasmid pSGLitleryC/Z/was isolated using standard 
procedures. The construct was confinned using restriction digests and sequence analysis. This 
cloning strategy allows the introduction of a to-tag C-terminal of EryCHI. 

30 Isolation of pSGLitl tylMI 

Plasmid pSGTYLM2 (WOO 1/7952) was digested with NdeVBgm and the insert fragment was 
isolated and ligated with the NdeVBgm treated vector fragment of pSGLitl. The ligation was 
used to transform E coli ET12567 and plasmid pSGLitlry/MZ7 was isolated using standard 
procedures. The construct was confirmed using restriction digests and sequence analysis. This 

3 5 cloning strategy allows the introduction of a his-t&g C-terminal of TylMB. 
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Isolation ofpSG144 

Plasmid pSGLitl was isolated and digested with NdeVBgM and an approximately 1.3 
kb insert was isolated. Plasmid pSG143 was digested with NdeVBgM, the vector band was 
isolated and ligated with the approximately 1.3 kb band from pSGLitl followed by 
transformation of E. coli DH10B. Plasmid pSG144 (F, ^) 2) was isolated and the construct 
was verified by DNA sequence analysis. This vector allows the assembly of gene cassettes 
directly in an expression vector (Figure 2) without prior assembly in pUC-derived vectors 
(WO 01/79520) in analogy to PCT/GB03/003230 using vector pSG144 instead of pSGsetl. 
Plasmid pSG144 differs from pSG142 in that thecal site between the thiostrepton resistance 
gene and the eryRHS has been deleted and the his- tag at the end of eryBV has been removed 
from pSG142 and replaced in pSG144 with anXbal site at the end of eryBV. This is to 
facilitate direct cloning of genes to replace eryBV and then build up the cassette. 

Isolation ofpSG144eryCIII 

EryCm was amplified by PCR reaction using standard protocols, with primers 
cas01eG21 (WO 01/79520) and caseryCIII2 (WO 01/79520) and plasmid pSGCIII (Gaisser et 
al. 9 2000) as template. The approximately 1 .3 kb PCR product was isolated and cloned into 
pUC18 using standard techniques. Plasmid pUCCIIIcass was isolated and the sequence was 
verified. The insert fragment of plasmid pUCCIIIcass was isolated after NdeVXbal digestion 
and ligated with the NdeVXbal digested vector fragment of pSG144. After the transformation 
of E. coli DH10B plasmid pSG144eryCfl7 was isolated using standard techniques. 

Isolation ofpUC19tylAI 

Primers BIOSG34 5'- 
GG GCATATGA ACGACCGTCCCCGCCGCGCCATGAAGGG- 3' and 5'- 
CCCCTCTAGAGGTC ACTGTGCCCGGCTGTCGGCGGCGGCCCCGCGC ATGG-3 * were 
used with genomic DNA of Streptomyces fradiae as template to amplify tylAL The amplified 
product was cloned using standard protocols and plasmid p\5C\9tylAI was isolated. The insert 
was verified by DNA sequence analysis. Differences to the published sequence are shown in 
Figure 3. 

Isolation ofpSGLU2 

Plasmid Litmus 28 was digested with SpeVXbal and the vector fragment was isolated. 
Plasmid pSGLitl {dam) was digested with Xbal and the insert band was isolated and ligated 
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with the SpeVXbal digested vector fragment of Litmus 28 followed by the transformation of 
E, coli DH10B using standard techniques. Plasmid pSGLit2 was isolated and the construct 
was verified by restriction digest and sequence analysis. This plasmid can be used to add a 5' 
region containing an Xbal site sensitive to Dam methylation and a Shine Dalgarno region thus 
5 converting genes which were originally cloned with an Ndel site overlapping the start codon 
and an Xbal site 3 * of the stop codon for the assembly of gene cassettes. This conversion 
includes the transformation of the ligations into E. coli ET12567 followed by the isolation of 
dam DNA and Xbal digests. Examples for this strategy are outlined below. 

1 0 Isolation ofpSGLitltylAI 

Plasmid pSGLit2 and pXSC\9tylAI were digested with Ndel I Xbal and the insert band 
of p\JCl9tylAI and the vector band of pSGLit2 were isolated, ligated and used to transform R 
coli ET12567. Plasmid pSGUtltylAI (dam) was isolated. 

1 5 Isolation ofpUC19tylAII 
Primers 5' — 

CCCCTCIA^GGTCATGCGCGCTCCAGTTCCCTGCCGCCCGGGGACCGCTTG- 3 ' 
and 5'- 

GG GTCTAGAT CGATTAATTAAGGAGGACATTCATGCGCGTCCTGGTGACCGGAGG 
20 TGCGGGCTTCATCGGCTCGCACTTCA- 3' and genomic DNA of Streptomyces fradiae as 
template were used for a PCR reaction applying standard protocols to amplify tylAIL The 
approximately 1 kb sized DNA fragment was isolated and cloned into S>wal-cut pUC19 using 
standard techniques. The^DNA sequencing of this construct revealed that 12 nucleotides at the 
5* end had been removed possibly by an exonuclease activity present in the PCR reaction. 
25 The comparison of the amino acid sequence of the cloned fragment compared to the published 
sequence is shown in Figure 4. 

Isolation ofpSGLU2tylAU 

To add the missing S'-nucIeotides, pSGLit2 was digested with PacUXbal and the 
30 vector fragment was isolated and ligated with the Pacl/Xbal digested insert fragment of 
p\JCl9tylAII. The ligated DNA was used to transform E. coli ET12567 and plasmid 
pSGLit2tylAII(darn) was isolated. 
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Isolation ofplasmidpUC19eryCVI 

The eryCVI gene was amplified by PCR using primer BIOSG28 5'- 
GGG CATATG TACGAGGGCGGGTTCGCCGAGCTTTACGACC-3 ' and BIOSG29 5'- 
GGGGTCTAGAGGTCATCCGCGCACACCGACGAACAACCCG-3' and plasmid 
pNC062 (Gaisser et al. 9 1997) as a template. The PCR product was cloned into Smal digested 
pUC19 using standard techniques and plasmid p\3C\9eryCVI was isolated and verified by 
sequence analysis. 

Isolation of plasmid pSGLit2eryCVI 

Plasmid pUCl9eryCVIwas digested with NdeVXbal and ligated with the NdeVXbal 
digested vector fragment of pSGLit2 followed by transformation of E. coli ET12567. Plasmid 
pSGLit2eryCVI(dam~) was isolated. 

Isolation of plasmid pSG144tylAI 

Plasmid pSG144 and pMC\9tylAI were digested with NdeVXbal and the insert band of 
pUC190>£4Jand the vector band of pSG144 were isolated, ligated and used to transform E. 
coli DH10B. Plasmid pSGl44tylAI was isolated using standard protocols. 

Isolation of plasmid pSG144tylAItylAU 

Plasmid pSGUt2tylAII(dam) was digested with Xbal and ligated with Xbal digested plasmid 
pSGl44tylAI. The ligation was used to transform E. coli DH10B and plasmid 
pSGl44tylAItylAII was isolated and verified using standard protocols. 

Isolation of plasmid pSGLit2tylMin 

Plasmid pUC18tylM3 (Isolation described in WOO 1/79520) was digested with NdeVXbal and 
the insert band and the vector band of NdeVXbal digested pSGLit2 were isolated, ligated and 
used to transform E. coli ET12567. Plasmid pSGUtltylMIII (dam~) was isolated using 
standard protocols. The construct was verified using restriction digests and sequence analysis. 

Isolation of plasmid pSG144tylAItylAIItylMIII 

Plasmid pSGLit2(y/Afl// {dam) was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSG\44tylAItylAII. The ligation was used to transform E. coli DH10B 
and plasmid pSGl44tylAItylAIItylMUI no36 was isolated using standard protocols. The 
construct was verified using restriction digests and sequence analysis. 
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Isolation of plasmid pSGLit2tylB 

Plasmid pXJClStylB (Isolation described in WOO 1/79520) was digested with PacVXbal and 
the insert band and the vector band of PacVXbal digested pSGLit2 were isolated, ligated and 
used to transform K coli ET12567. Plasmid pSGUtliylB nol (darn ) was isolated using 
5 standard protocols. 

Isolation of plasmid pSG144tylAItylAIItylMUItylB 

Plasmid pSGLit2*y/£ (darn) was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSGl44tylAItylAIItylMIIL The ligation was used to transform E, coli 
1 0 DH10B and plasmid pSG\44tylAItylAIItylMIItylB no5 was isolated using standard protocols 
and verified by restriction digests and sequence analysis. 

Isolation of plasmid pUC 1 8tylla 

Primers BIOSG 88 5 ^GGGCATATGGCGGCGAGC ACTACGACGGAGGGGAATGT-3 ' 
1 5 and BIOSG 89 5 '-GGGTCTAGAGGTCACGGGTGGCTCCTGCCGGCCCTC AG-3 ' were 
used to amplify tylla using a plasmid carrying the tyl region (accession number 
u08223.eni_pro2) comprising ORF1 (cytochrome P450) to the end of ORF2 (TylB) as a 
template. Plasmid pUCtylla nol was isolated using standard procedures and the construct was 
verified using sequence analysis. 

20 

Isolation of plasmid pSGLit2tylIa 

Plasmid pUCtylla nol was digested with NdeVXbal and the insert band and the vector band 
of NdeVXbal digested pSGLit2 were isolated, ligated and used to transform E. coli ET12567. 
Plasmid pSGLttltylla no 54 (dam) was isolated using standard protocols. The construct was 
25 verified using sequence analysis. 

Isolation of plasmid pSG 144 tylAItylAIItylMUtylBtylla 

Plasmid pSGLit20>//tf (dam) was digested with Xbal and the insert band was ligated with 
Xbal digested plasmid pSG\44tylAItylAIItylMIIItylB. The ligation was used to transform E. 
30 coli DH10B and plasmid pSGl44tylAItylAIItylMIIItylBtylIa no3 was isolated using standard 
protocols and verified by restriction digests and sequence analysis. 

Isolation of plasmid pSGLitl tylMIeryCIU 

Plasmid pUCtylMI (Isolation described in WOO 1/79520) was Pacl digested and the insert was 
3 5 ligated with the Pacl digested vector fragment of pSGLit 1 eryCIII using standard procedures. 
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Plasmid pSGLitltylMIeryCIIl no20 was isolated and the orientation was confirmed by 
restriction digests and sequence analysis. 

Isolation of gene cassette pSGI44tyteItylAUtylMmtylBtyllatylMIeryCIII 
5 Plasmid pSGLitl tylMIeryCHI no20 was digested with XbaVBglU and the insert band was 
isolated and ligated with the XbaVBglR digested vector fragment of plasmid 
pSGlUtylAItylAIItylMIIItylBtylla no3. Plasmid 

pSG144tylAItylAHtylMnitylBtyllatylMeryCIIIwas isolated using standard procedures and 
the construct was confirmed using restriction digests and sequence analysis. Plasmid 
10 preparations were used to transform S. erythraea mutant strains with standard procedures. 

Isolation of plasmid pSGKCl 

To prevent the conversion of the substrate 3-Omycarosyl erythronoiide B to 3,5-di-O- 
mycarosyl erythronoiide B a further chromosomal mutation was introduced into S. erythraea 
1 5 SGQ2 (Isolation described in WO 0 1/79520) to prevent the biosynthesis of L-mycarose in the 
strain background. Plasmid pSGKCl was isolated by cloning the approximately 0.7 kb DNA 
fragment of the eryBVI gene by using PCR amplification with cosmid2 or plasmid pGGl 
(WOO 1/79520) as a template and with the primers 646 5'- 

CATCGTCAAGGAGTTCGACGGT- 3' and 874 5'-GCCAGCTCGGCGACGTCCATC- 
20 3 'using standard protocols. Cosmid 2 containing the right hand site of the ery-cluster was 
isolated from an existing cosmid library (Gaisser et al., 1997) by screening with eryBVas a 
probe using standard techniques. The amplified DNA fragment was isolated and cloned into 
EcoRV digested pKCl 132 (Bierman et al., 1992) using standard methods. The ligated DNA 
was used to transform E. coli DH10B and plasmid pSGKCl was isolated using standard 
25 molecular biological techniques. The construct was verified by DNA sequence analysis. 

Isolation ofS. erythraea Q42/1 (Biot-2166) 

Plasmid pSGKCl was used to transform S. erythraea SGQ2 using standard techniques 
followed by selection with apramycin. Thiostrepton/apramycin resistant transformant S. 
30 erythraea Q42/1 was isolated. 

Byconversion using S. erythraea Q42/lpSG144tylAItylAIItylMUItylBtylIatylMeryCm 
Byconversion assays using 3-O-mycarosyl erythronoiide B are carried out as described in 
General Methods. Improved levels of mycaminosyl erythromycin A are detected in 
35 bioconversion assays using S. erythraea 
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Q42/lpSGl44tylAItyIMItylMHItylBtylIatylMe?yCffl to bioconversion levels 

previously observed (WO01/79520). 

Example 2: Isolation of mycaminosyl tylactone using gene cassette 
5 ^SGlUtyMItylAntylMIIItylBtyllatylMItylMII 

Isolation ofplasmidpSGLitltylMtylMII 

Plasmid pUCtylMI (Isolation described in WOO 1/79520) was Pad digested and the insert was 
ligated with the Pad digested vector fragment of pSGLitl(y/Afl7 using standard procedures. 
1 0 Plasmid pSGLitl tylMItylMII no 1 6 was isolated and the construct was confirmed by 
restriction digests and sequence analysis. 

Isolation of plasmid pSG144tylAItylMItylMmtylBtylIatylMtylMII 
Plasmid pSGLitl0>/M7i)//MZ7nol6 was digested with Xbal/BgUL and the insert band was 
1 5 isolated and ligated with the XbaUBglH digested vector fragment of plasmid 
pSGl44tylAItylAIItylMIUtylBtylIa no3. Plasmid 

pSGlAAtylAItylAJItylMUItylBtyllatylMtylMII was isolated using standard procedures and the 
construct was confirmed using restriction digests and sequence analysis. The plasmid was 
isolated and used for transformation of S. erythraea mutant strains using standard protocols. 

20 

Bioconversion using gene cassette pSG144tylMtylMItylMmtylBtyllatylMItylMn 

The conversion of fed tylactone to mycaminosyl tylactone was assessed in bioconversion 

assays using & erythraea Q42l\pSG\44tylAItylMItylMIItylBtyllatylM 

Bioconversion assays were carried out using standard protocols (see Chemical Request sheet 

25 81). The analysis of the culture showed the major ion to be 568.8 [M+H] + consistent with the 
presence of mycaminosyl tylactone. Fragmentation of this ion gave a daughter ion of m/z 174, 
as expected for protonated mycaminose. No tylactone was detected during the analysis of the 
culture extracts, indicating that the bioconversion of the fed tylactone was complete. 
Recently, a homologue of Tylla was identified in the biosynthetic pathway of dTDP-3- 

30 acetamido-3,6-dideoxy-alpha-D-galactose in Aneurinibacillus thermoaerophilus L420-91 T * 
(Pfoestl et al., 2003) and the function was postulated as a novel type of isomerase capable of 
synthesizing dTDP-6-deoxy-D-xylohex-3-ulose from dTDP-6-deoxy-D-xylohex-4-ulose. 
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Example 3: Byconversion of 3-0-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0- 
mycaminosyl erythromycins using gene cassette pSG1448/27/95/21/44/193/6eryCIII 
(pSG144angAIangAnorfl4angMniangBaiigMIeryCIJI). 

Cloning ofangMIII by isolating plasmid Litl/4 

The gene angMIUvfas amplified by PCR using the primers BIOSG61 5'- 
GGGCATATGAGCCCCGCACCCGCCACCGAGGACCC -3' and BIOSG62 5'- 
GGTCTAGAGGTCAGTTCCGCGGTGCGGTGGCGGGCAGGTCAC -3'. Cosmid5B2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.4 
kb PCR fragment (PCR nol) was cloned using standard procedures and EcoRV digested 
plasmid Litmus28. Plasmid Litl/4 was isolated with an Ndel site overlapping the start codon 
of angMUI and an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 

Isolation of plasmid pSGLit21/4 

Plasmid UU/4 was digested with NdeVXbal and the about 1 .4 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET12567 and plasmid pSGLit27/4 no7 (dam) was isolated. This construct was digested with 
Xbal and used for the construction of gene cassettes. 

Cloning ofangMII by isolating plasmid LU2/8 

The gene angMH was amplified by PCR using the primers BIOSG63 5*- 
GGGCATATGCGTATCCTGCTGACGTCGTTCGCGCACAACAC -3' and BIOSG64 5'- 
GG TCTAGAG GTCAGGCGCGGCGGTGCGCGGCGGTGAGGCGTTCG -3* and 
cosmid5B2 containing a fragment of the angolamycin biosynthetic pathway was used as 
template. The 1.3 kb PCR fragment (PCR no2) was cloned using standard procedures and 
£coRV digested plasmid Litmus28. Plasmid Lit2/<5 was isolated with an Ndel site overlapping 
the start codon of angMU and an Xbal site following the stop codon. The construct was 
verified by sequence analysis. 

Cloning ofangMII by isolating plasmid pLitangMII(BglII) 
The gene angMHwas amplified by PCR using primers BIOSG63 5'- 
GGGCATATGCGTATCCTGCTGACGTCGTTCGCGCACAACAC -3' and BIOSG80 5'- 
GG AGATCTG GCGCGGCGGTGCGCGGCGGTGAGGCGTTCG -3' and cosmid5B2 
containing a fragment of the angolamycin biosynthetic pathway as template. The 1.3 kb PCR 
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fragment was cloned using standard procedures and EcoKV digested plasmid Litmus28. 
Plasmid LitangMU(BglIl)noS was isolated with an Ndel site overlapping the start codon of 
angMHand a BgKL site instead of a stop codon thus allowing the addition of a Aw-tag. The 
construct was verified by sequence analysis. 

5 

Isolation of plasmid pSGLitlcmgMU 

Plasmid UtangMH(Bgin) was digested with NdeVBgfil and ligated with the NdeVBglH 
digested vector fragment of pSGLitl. The ligation was used to transform E. colt ET 12567 and 
plasmid ^SGlAtlangMII (dam) was isolated using standard procedures. 

10 

Cloning ofangMI by isolating plasmid LU3/6 

The gene angMIv/as amplified by PCR using the primers BIOSG65 5 1 - 
GGGCATAIGAACCTCGAATACAGCGGCGACATCGCCCGGTTG -3 f and BIOSG66 5 1 - 
GGTCTAGAGGTCAGGCCTGGACGCCGACGAAGAGTCCGCGGTCG -3' and 
15 cosmid5B2 containing a fragment of the angolamycin biosynthetic pathway was used as 

template. The 0.75 kb PCR fragment (PCR no3) was cloned using standard procedures and 
EcdKV digested plasmid Litmus28. Plasmid lA\3/6 was isolated with an Ndel site overlapping 
the start codon of angMI and an Xbal site following the stop codon. The construct was 
verified by sequence analysis. 

20 

Isolation of plasmid pSGlit23/6 no8 

Plasmid Lit?/5 was digested with NdeVXbal and the about 0.8 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET 12567 and plasmid pSGLit23/<5 no8 (dam) was isolated. This construct was digested with 
25 Xbal and the isolated about 1 kb fragment was used for the assembly of gene cassettes. 

Cloning ofangB by isolating plasmid Lit4/ 19 

The gene angB was amplified by PCR using the primers BIOSG67 5 ! - 
GGGCAT^TCACTACCTACGTCTGGGACTACCTGGCGG -3' and BIOSG68 5'- 

30 GGTCTAGAGGTCAGAGCGTGGCCAGTACCTCGTGCAGGGC -3 1 and cosmid4H2 

containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.2 
kb PCR fragment (PCR no4) was cloned using standard procedures and EcoKV digested 
plasmid Litmus28. Plasmid LH4/19 was isolated with an Ndel site overlapping the start codon 
ofangB and an Xbal site following the stop codon. The construct was verified by sequence 

35 analysis. 
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Isolation of plasmid pSGlit24/19 

Plasmid LH4/19 was digested with NdeVXbal and the 1 .2 kb fragment was isolated and 
ligated into NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
ET12567 and plasmid pSGLit24//P no24 (dam) was isolated. This construct was digested 
with Xbal and the isolated 1 .2 kb fragment was used for the assembly of gene cassettes. 

Cloning of orfl4 by isolating plasmid LitS/2 

The gene orfl4 was amplified by PCR using the primers BIOSG69 5- 
GG GCATATGG TGAACGATCCGATGCCGCGCGGCAGTGGCAG-3' and BIOSG70 5'- 
GGTCTAGAGGTC AACCTCC AG AGTGTTTCG ATGGGGTGGTGGG-3 ' and cosmid4H2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 1.0 
kb PCR fragment (PCR no5) was cloned using standard procedures and EcoRV digested 
plasmid Litmus28. Plasmid LitJ/2 was isolated with an Ndel site overlapping the start codon 
of ORF 14 and an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 

Isolation of plasmid pSGlit25/2 no24 

Plasmid Lit5/2 was digested with NdeVXbal and the approximately 1 kb fragment was 
isolated and ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to 
transform E. coli ET12567 and plasmid pSGLit2J/2 no24 (dam) was isolated. This construct 
was digested with Xbal, the about 1 kb fragment isolated and used for the assembly of gene 
cassettes. 

Isolation of plasmid pSGlit27/9 nol5 

Plasmid Lit7/P was digested with NdeVXbal and the approximately 1 kb fragment was 
isolated and ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to 
transform E. coli ET12567 and plasmid P SGLit27/P nol5 (darn') was isolated. This construct 
was digested with Xbal and the isolated 1 kb fragment was used for the assembly of gene 
cassettes. 

Cloning of angAI (6rf2) by isolating plasmid Lit8/2 
The gene angAI was amplified by PCR using the primers BIOSG73 5'- 
GGGCATAT^AAGGGC ATC ATCCTGGCGGGCGGC AGCGGC-3 1 and BIOSG74 
GGTCTAGAGGTCATGCGGCCGGTCCGGACATGAGGGTCTCCGCCAC-3 1 and 
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cosmid4H2 containing a fragment of the angolamycin biosynthetic pathway was used as 
template. The around 1.0 kb PGR fragment (PCR no8) was cloned using standard procedures 
and EcoKV digested plasmid Litmus28. Plasmid IAX8/2 was isolated with an Ndel site 
overlapping the start codon of angAI and an Xbal site following the stop codon. The construct 
5 was verified by sequence analysis. 

Cloning ofangAH (or/3) by isolating plasmid LU7/9 
The gene angAIIwas amplified by PCR using the primers BIOSG71 5'- 
GGG C ATATG CGGCTGCTGGTCACCGGAGGTGCGGGC-3 ' and BIOSG72 5'- 
1 0 GGTCTAGAGGTC AGTCGGTGCGCCGGGCCTCCTGCG-3 1 and cosmid4H2 containing a 
fragment of the angolamycin biosynthetic pathway was used as template. The 1.0 kb PCR 
fragment was cloned using standard procedures and EcoKV digested plasmid Litmus28. 
Plasmid Lit7/P was isolated with an Ndel site overlapping the start codon of angAU and an 
Xbal site following the stop codon. The construct was verified by sequence analysis. 

15 

Isolation of plasmid pSGlit28/2 no 18 (pSGLU2angAI) 

Plasmid LitS/2 was digested with NdeVXbal and the 1 kb fragment was isolated and ligated to 
NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli ET 12567 
and plasmid pSGLit2S/2 no 1 8 (dam) was isolated. 

20 

Isolation of plasmid pSGl 448/2 (pSG144angAI) 

Plasmid LitS/2 was digested with NdeVXbal and the approximately 1 kb fragment was 
isolated and ligated with NdeVXbal digested DNA of pSG144. The ligation was used to 
transform E. coli DH10B and plasmid pSG144S/2 {dam) (pSG144owg>47) was isolated using 
25 standard procedures. This construct was verified with restriction digests and sequence 
analysis. 

Isolation of plasmid pSG 1448/27/9 (pSG144angAIangAU) 

Plasmid pSGLit2 7/9 (isolated from Ecoli ET12567) was digested with Xbal and the 1 kb 
30 fragment was isolated and ligated with the Xbal digested vector fragment of pSG 1445/2 
(pSG\44angAI). The ligation was used to transform E. coli DH10B and plasmid 
pSG1445/27/P (pSGl44angAIangAII) was isolated using standard protocols. The construct 
was verified with restriction digests and sequence analysis. 

35 Isolation of plasmid pSG 1448/27/9 1/4 (pSG144angAIangAIIangMIII) 
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Plasmid pSGLit27/4 (isolated from E. coli ET 12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with the AM digested vector fragment of pSG144S/27/P 
(pSG 1 44angAIangAII). The ligation was used to transform E. coli DH10B and plasmid 
pSGl 445/2 7/91/4 (pSGl44angAIangAIIangMIII) was isolated using standard protocols. The 
5 construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/91/44/1 9 (pSG144angAIangAUangMniangB) 
Plasmid pSGUt24/19 (isolated from E. coli ET12567) was digested with Xbal and the about 
1 .2 kb fragment was isolated and ligated with the XbcA digested vector fragment of 
10 pSG144<9/27/P//4 (pSGl44angAIangAIIangMIII). The ligation was used to transform E. coli 
DH10B and plasmid pSGU48/27/9 1/44/1 9 {pSG\44angAIangAIIangMIUangE) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
analysis. 

1 5 Isolation of plasmid pSGl 448/27/9 1/44/1 93/6 (pSG144cmgAIangAIIangA4IIIangBangAdI) 
Plasmid pSGLit23/6 (isolated from E. coli ET12567) was digested with Xbal and the about 
0.8 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG144S/27/P//44/iP (pSGU4angAIangAIIangMIIIangB). The ligation was used to 
transform E. coli DH10B and plasmid pSG1445/27/Pi/^//P3/<J 

20 (pSG 1 44angAIangAIIangMIIIangBangMI) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGl 448/27/91/44/1 93/6eryCIH 
(pSG144cmgAIangAIIcmgMIUcmgBcmgMIeryCIII) 
25 Plasmid pSGLit 1 eryCUI (isolated from E. coli ET12567) was digested with XbaVBglR and 
the about 1.2 kb fragment was isolated and ligated with the Xbal digested and partially BglH 
digested vector fragment of pSG144S/27/P//*///P3/5 

(pSGl44angAIangAIIangMlIIangBangMI). The BglTL partial digest was necessary due to the 
presence of a BglU site in angB. The ligation was used to transform E. coli DH10B and 
30 plasmid pSG 1445/2 7/91/44/1 93/6eryCIIIno9 

(pSG 1 44angAIangAIIangMIIIcmgBangMIeryCIII) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. EryCm carries a ta.y-tag 
fusion at the end. 
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Bioconversion of 3-O-mycarosyI erythronolide B to S-O-dedesosaminyl-5-O-mycaminosyl 
erythromycin A using S. erythraea Q42/lpSGl448/27/9\/44/193/6eryCIIIno9 
(pSGl44cmgAIangAIIangAdIIIangBangMIeryCIII) 

The & erythraea strain Q42/lpSGl448/27/91/44/193/6eryCIII was grown and byconversions 
5 with fed 3-O-mycarosyl erythronolide B were performed as described in the General 

Methods. The cultures were analysed and a small amount of a compound with m/z 750 was 
detected consistent with the presence of 5-0-dedesosaminyl-5-O-mycaminosyl erythromycin 
A. 

10 Isolation of plasmid pSGI 448/27/95/2 (pSG144angAIangAIIorfl4) 

Plasmid pSGLit25/2 (isolated from E. coli ET 12567) was digested with Xbal and the about 1 
kb fragment was isolated and ligated with the Xbal digested vector fragment of pSG144S/27/9 
(pSGl44angAIangAU). The ligation was used to transform E. coli DH10B and plasmid 
pSGl448/27/95/2 (pSGl44angAIangAHorfl4) was isolated using standard protocols. The 

15 construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGI 448/27/95/2 1/4 (pSG144angAIangAnorfl4angMIII) 
Plasmid pSGLit2//4 (isolated from E. coli ET12567) was digested with^&al and the 1.4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
20 pSG\448/27/95/2 (pSG\44angAIangAIIorfl4). The ligation was used to transform E. coli 

DH10B and plasmid pSGl448/27/95/21/4 {pSG\44angAIangAUorfl4angMIII) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
analysis. 

25 Isolation of plasmid pSG 1 448/27/9 5/2 1/44/19 (pSG144angAIangAIIorfl4angMIIIangB) 

Plasmid pSGLit24/7P (isolated from £. coli ET12567) was digested with^tal and the 1.2 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/95/21/4 (pSGl44angAIangAUorfl4angMIII). The ligation was used to transform 
E. coli DH10B and plasmid pSG\448/27/95/2 1/44/1 9 

30 (pSGl44angAIangAIIorf74angMIIIangB) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSGI 448/27/95/2 1/44/1 93/6eryCIII 
(pSG144angAIcmgAtfotfl4angMmcmgBcmgMeryCIII) 
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Plasmid pSGl448/27/91/44/193/6eryCIIl no9 was digested with BglR and the about 2 kb 
fragment was isolated and ligated with the BglK digested vector fragment of 
pSGl448/27/95/21/44/19 (pSGl44mtgAIangAHorfl4angAmiangB). The ligation was used to 
transform E. co/iDHIOB and plasmid pSGl 448/2 7/95/2 1/44/1 93/6eryCIII 
5 (pSG\44angAIangAnorfl4angMfflcmgBcmgMeryCIH) was isolated using standard 
protocols. The construct was verified with restriction digests and sequence analysis. 
EryCIII carries a his-tag fusion at the end. The construct was used to transform S. erythraea 
SGQ2 using standard procedures. 

1 0 Byconversion of 3-O-mycarosyl erythronolide B to S-O-dedesosaminyl-5-O-mycaminosyl 
erythromycin A 

The S. erythraea strain SGQ2pSGl448/27/95/2 1/44/1 93/6eryCHI was grown and 
byconversions with fed 3-O-mycarosyl erythronolide B were performed as described in the 
General Methods. The cultures were analysed and improved amounts of a compound with m/z 
1 5 750 was detected consistent with the presence of 5-0-dedesosaminyl-5-0-mycaminosyl 

erythromycin A. Similar results were obtained with the S. erythraea strain Q42/1 containing 
the gene cassette pSGl448/27/95/2 1/44/1 93/6eryCIIL 

16 mg of the compound with m/z 750 was purified and the structure of 5-0-dedesosaminyl-5- 
O-mycaminosyl erythromycin A was confirmed by NMR analysis (See Table I and Figure 1). 

20 

Table ZT: l Hand 13 C NMR data for 5-0-dedesosaminvl-5-Q-mvcaminosvl erythromycin A 
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80.0 


4 


2.00 


m 




39.1 


5 


3.53 


d 


6.8 


85.4 


6 








74.8 


7 


1.66 


dd 


14.8, 2.2 


38.5 




1.82 


dd 


14.8, 11.4 




8 


2.69 


dqd 


11.3, 7.0,2.2 


44.9 


9 






221.6 


10 


3.06 


qd 


6.9, 1.3 


38.0 


11 


3.81 


d 


1.3 


68.9 


12 








74.6 


13 


5.04 


dd 


11.0,2.3 


76.8 a 


14 


1.47 


dqd 


14.3, 11.0,7.2 


21.1 




1.91 


ddq 


14.3,7.5, 2.2 




15 


0.83 


dd 


7.4, 7.4 


10.6 


16 


1.18 


d 


7.1 


16.0 


17 


1.03 


d 


7.4 


9.7 


18 


1.44 


s 




26.6 
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Position 


S H 


Multiplicity 


Counlinp 


5r- 


19 


1 16 


A 
u 


7 0 


1 O.J> 


20 


1 14 


A 
u 


7 n 


1 7 n 

1Z.U 


91 


1 19 
i . iz 


c 

s 




1 £ 7 
J. O.Z 


1 ' 
1 


A R7 


A 
U 


A ft 


OA A 


9' 




qq 


1^7 AC 


1A O 






oa 


1^7 no 




J 








77 Q 
/Z.o 


4' 
*t 


J.Ul 


A 

a 




77 ft 
/ /.O 


.> 


1 QQ * 


dq 


O 1 £ 7 

y.j, o.z 


OJ.O 


o 


1 77 *•' 
I.Z / 


a 


£ 7 

o.z 


1 ft ^ 


7' 
/ 


1 77 
l.ZJ 


s 




7 1 A 


ft' 
o 


1 70 


s 




AO A 


1 


A A*\ 


A 

a 


7 A 




2" 


3.56 


dd 


10.5, 7.3 


71.3 


3" 


2.48 


dd 


10.3, 10.3 


70.6 


4" 


3.09 


dd 


9.9, 9.0 


70.2 


5" 


3.31 


dq 


9.0, 6.1 


72.9 


6" 


1.29 


d 


6.1 


18.1 


7" 


2.58 


s 




41.7 



3 This carbon was assigned from the HMQC spectrum 



Example 4: Isolation of mycaminosyl tylactone 

Isolation ofplasmidpSGl 448/27/95/2 1/44/1 93/6tylMU 
5 (pSG144angMangMIorfl4angmttcmgB3/6tylMn) 

Plasmid pSG 1 445/2 7/9 1/44/1 93/6ty!MIIno9 was digested with BgHl and the about 2 kb 
fragment was isolated and ligated with the BglFL digested vector fragment of 
pSGl448/27/95/2 1/44/1 9 (jpSGl44angAIangAIIorfl4angMIUangB). The ligation was used to 
transform E. coli DH10B and plasmid pSGl 445/2 7/P5/2 1/44/1 93/6tylMn 
1 0 (pSG 1 44angAIangAIIorfl 4angA4IIIangBangMItylMU) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 
TylMII carries a /zfr-tag fusion at the end. 

Byconversion of tylactone to mycaminosyl tylactone 
1 5 The S. erythraea strain Q42/lpSG144S/27/PJ/2i/^^7P3/5(y/Afl7 is grown and byconversions 
with fed tylactone is performed as described in the General Methods. The cultures are 
analysed and a compound with m/z 568 is detected consistent with the presence of 
mycaminosyl tylactone. 
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Example 5: Isolation of 5-0-dedesosaminyl-5-O-angolosaminyl erythromycins using 
gene cassette pSGU48/27M/4spnOS/2p4/I93/6tylMII by byconversion of 3-O-mycarosyl 
erythronolide B. 

Isolation of plasmid conv nol 

For the multiple use of promoter sequences in acf-controlled gene cassettes a 240 bp fragment 
was amplified by PCR using the primers BIOSG78 5'- 

GG GCATATGT GTCCTCCTTAATTAATCGATGCGTTCGTCC-3' and BIOSG79 5'- 
GGAGATCTGGTCTAGATCGTGTTCCCCTCCCTGCCTCGTGGTCCCTCACGC -3' and 
plasmid pSG142 (Gaisser et ah, 2000) as template. The 0.2 kb PCR fragment (PCR no5) was 
cloned using standard procedures and EcoRV digested plasmid Litmus28. Plasmid conv nol 
was isolated. The construct was verified by sequence analysis. 

Isolation of pSGLit3religl 

Plasmid conv nol was digested with NdeVBgM and the about 0.2 kb fragment was isolated 
and ligated with the BamHUNdel digested vector fragment of pSGLit2. The ligation was used 
to transform E. coli DH10B and plasmid pSGLit3religl was isolated using standard 
procedures. This construct was verified using restriction digests and sequence analysis. 

Isolation of plasmid pSGlit34/19 

Plasmid IAX4/19 was digested with NdeVXbal and the 1 .2 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLiO. The ligation was used to transform E. coli 
ET12567 and plasmid P SGLit347P no23 was isolated. This construct was digested with Xbal 
and the isolated 1 .4 kb fragment was used for the assembly of gene cassettes. 

Cloning oforf4 by isolating plasmid LU6/4 

The gene orf4 was amplified by PCR using the primers BIOSG75 5'- 
GGG CATATGA GCACCCCTTCCGCACCACCCGTTCCG-3' and BIOSG76 5'- 
GGTCTAGAGGTCAGTACAGCGTGTGGGCACACGCCACCAG-3' and cosmid4H2 
containing a fragment of the angolamycin biosynthetic pathway was used as template. The 2.5 
kb PCR fragment (PCR no6) was cloned using standard procedures and EcoKV digested 
plasmid Litmus28. Plasmid Ut6/4 was isolated with an Ndel site overlapping the start codon 
of orf4 and an Xbal site following the stop codon. The construct was verified by sequence 
analysis. 
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Isolation of plasmid pSGlit26/4 no9 

Plasmid Uit6/4 was digested with NdeVXbal and the DNA was isolated and ligated to 
NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli ET 12567 
and plasmid pSGLit26/4 no9 was isolated. This construct was confirmed by restriction digests 
5 and sequence analysis. 

Cloning of spnO by isolating plasmid pUC19spnO 

The gene spnO from the spinosyn biosynthetic gene cluster of Saccharopolyspora spinosa 
was amplified by PCR using the primers BIOSG41 5'- 

1 0 GGGCATATG AGC AGTTCTGTCGAAGCTGAGGC AAGTG-3 1 and BIOSG42 5 f - 

GGl^TAGAGGTCATCGCCCC AACGCCCAC AAGCTATGC AGG-3 1 and genomic DNA 
of £ spinosa as template. The about 1 .5 kb PCR fragment was cloned using standard 
procedures and Smal digested plasmid pUC19. Plasmid p\JCl9spnO no2 was isolated with an 
Ndel site overlapping the start codon of spnO and znXbal site following the stop codon. The 

1 5 construct was verified by sequence analysis. 

Isolation of plasmid pSGlitlspnO no4 

Plasmid pUC\9spnO was digested with NdeVXbal and the 1.5 kb fragment was isolated and 
ligated to NdeVXbal digested DNA of pSGLit2. The ligation was used to transform E. coli 
20 ET12567 and plasmid pSGLitlspnO no 4 was isolated using standard procedures. This 

construct was digested with Xbal and the isolated 1 .5 kb fragment was used for the assembly 
of gene cassettes. 

Isolation of plasmid pSGl 448/27/9 l/4spnO (pSG144angAIangAIIangMIIIspnO) 
25 Plasmid pSGLi&spnO no4 (isolated from E. coli ET12567) was digested with Xbal and the 
1.5 kb fragment was isolated and ligated with thecal digested vector fragment of 
pSGl 445/2 7/91/4 (pSG\44angAIangAIIangMIII). The ligation was used to transform E. coli 
DH10B and plasmid pSGU48/27/91/4spnO {pSG\44angAIangAIIangMIUspnO) was isolated 
using standard protocols. The construct was verified with restriction digests and sequence 
30 analysis. 

Isolation of plasmid pSG 1448/27/9 l/4spnOS/2 (pSG144angAIangAIIangMIIIspnOango?fl4) 
Plasmid pSGLit2 5/2 no24 (isolated from E. coli ET12567) was digested with Xbal and the 1 
kb fragment was isolated and ligated with the Xbal digested vector fragment of 
35 pSGl 445/2 7/91/4spnO (pSG\44angAIangAIIangMIUspnO). The ligation was used to 
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transform E. coli DH10B and plasmid pSGl448/2 7/9 l/4spnOS/2 

(pSGl44cmgAIangAIIangMIIIspnOangorfI4) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

5 Isolation of plasmid pSG1448/27/91/4spnOS/2p4/19 
(pSG144angAIangAIIangMIIspnOangorfl 4pangB) 

Plasmid pSGLit34/7P no23 (isolated from E. coli ET12567) was digested with^&al and the 
about 1.4 kb fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/91/4spnOS/2 ($$G\44cmgAIangAIIangMIIIspnOangorfl4). The ligation was 
10 used to transform £. coli DH10B and plasmid pSG\448/27/91/4spn05/2p4/19 

{pSG\44angAIangAIIangMIIIspnOangorfl4pangE) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. *p' indicates the 
presence of the promoter region in front of angB to emphasize the presence of multiple 
promoter sites in the construct. 

15 

Isolation ofplasmidpSG1448/27/91/4spn05/2p4/193/6eryCIII 
(pSG144angAIangAIIangMIIIspnOorfl 4pangBangMIeryCIH) 

Plasmid pSGl448/27/9I/44/193/6eryCIH no9 was digested with BglR and the about 2 kb 
fragment was isolated and ligated with the BgKL digested vector fragment of 
20 pSGl448/27/91/4spnQ5/2p4/I9 (pSGl44angAIangAIIcmgMIIIspnOorfI4pangB). The 
ligation was used to transform E. coli DH10B and plasmid 
pSG\448/27/91/4spn05/2p4/193/6eryCni 

(pSGl44angAIangAIIcmgMHIspnOorfl4pangBangMIeryCIII) was isolated using standard 
protocols. The construct was verified with restriction digests and sequence analysis. EryCIII 
25 carries a fe-tag fusion at the end. *p' indicates the presence of the promoter region in front of 
angB to emphasize the presence of multiple promoter sites in the construct. The plasmid 
construct was used to transform mutant strains of S. erythraea using standard procedures. 

Bioconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-angolosaminyl 

30 erythromycins 

Strain S. erythraea Q42/lpSG\448/27/9I/4spn05/2p4/193/6eryCIIIwas grown and 
bioconversions with fed 3-O-mycarosyl erythronolide B were performed as described in the 
General Methods. The cultures were analysed and peaks with m/z 704, m/z 718 and m/z 734 
consistent with the presence of angolosaminyl erythromycin D, B and A, respectively, were 

35 observed. 
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Example 6: Production of 5-0-angoIosaminyl tylactone 

Isolation of plasmidpSG 1448/27/9 l/4spn05/2p4/193/6tylMI 
(pSG144angAIangAIIangMIIIspnOorfl 4pangBangMItylMII) 
5 Plasmid pSG1448/27/91/44/193/6#/Afl7 no9 was digested with BgttL and the about 2 kb 
fragment was isolated and ligated with the BglR digested vector fragment of 
pSGl448/27/91/4spn05/2p4/19 (pSGU4angAIcmgAIIangMIIspnOorfl4pangB). The 
ligation was used to transform E. coli DH10B and plasmid 
pSG\448/27/91/4spn05/2p4/193/6tyIMII 
10 (pSG144angAIangAIIangMinspnOorfl4pangBangMItylMII) was isolated using standard 

protocols. The construct was verified with restriction digests and sequence analysis. TylMII 
carries a /ws-tag fusion at the end. The plasmid was used to transform mutant strains of S. 
erythraea applying standard protocols. *p 9 indicates the presence of the promoter region in 
front of angB to emphasize the presence of multiple promoter sites in the construct. 

15 

Isolation ofS. erythraea 18Al(BIOT-2634) 

To introduce a deletion comprising the PKS and majority of post PKS genes in S. erythraea a 
region of the left hand side of the ery- cluster (LHS) containing a portion of eryCI, the 
complete ermE gene and a fragment of the ery.fi/gene were cloned together with a region of 

20 the right hand side of die ery- cluster (RHS) containing a portion of the eryBVII gene, the 
complete eryK gene and a fragment of DNA adjacent to eryK. This construct should enable 
homologous recombination into the genome in both LHS and RHS regions resulting in the 
isolation of a strain containing a deletion between these two regions of DNA. The LHS 
fragment (2201 bp) was PCR amplified using S. erythraea chromosomal DNA as template 

25 and primers BIdelNde (5 ' -CCCATATGACCGGAGTTCGAGGTACGCGGCTTG-3 9 ) and 
BIdelSpe (5'-GATACTAGTCCGCCGACCGCACGTCGCTGAGCC-3'). Primer BIdelNde 
contains an Ndel restriction site (underlined) and primer BIdelSpe contains a Spel restriction 
site used for subsequent cloning steps. The PCR product was cloned into the Smal restriction 
site of pUC19, and plasmid pLSB177 was isolated using standard procedures. The construct 

30 was confirmed by sequence analysis. Similarly, RHS (2158 bp) was amplified by PCR using 
S. erythraea chromosomal DNA as template and primers BVTIdelSpe (5'- 
TGC ACTAGTGGCCGGGCGCTCGACGTCATCGTCGACAT-3 ') and BVHdelEco (5'- 
TC GATATCG TGTCCTGCGGTTTC ACCTGCAACGCTG-3 ' V Primer BVHdelSpe contains 
a Spel restriction site and primer BVHdelEco contains an EcoKV restriction site. The PCR 

35 product was cloned into the Smal restriction site of pUC19 in the orientation with Spel 
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positioned adjacent to Kpril and EcdKV positioned adjacent to Xbal. The plasmid pLSB178 
was isolated and confirmed using sequence analysis. Plasmid pLSB177 was digested with 
Ndel and Spel y the ~2.2kb fragment was isolated and similarly plasmid pLSB178 was 
digested with Ndel and Spel and the -4.6 kb fragment was isolated using standard methg^s. 

5 Both fragments were ligated and plasmid pLSB 1 88 containing LHS and RHS combined 
together at a Spel site in pUC19 was isolated using standard protocols. An NdellXbal 
fragment (-4.4 kbp) from pLSB188 was isolated and ligated with Spel and Ndel treated 
pCJR24. The ligation was used to transform E. coli DH1 OB and plasmid pLSB 1 89 was ' 
isolated using standard methods. Plasmid pLSB189 was used to transform S. erythraea P2338 

10 and transformants were selected using thiostrepton. S. erythraea Del 18 was isolated and 
inoculated into 6 ml TSB medium and grown for 2 days. A 5% inoculum was used to 
subculture this strain 3 times. 100 \i\ of the final culture were used to plate onto R2T20 agar 
followed by an incubation at 30°C to allow sporulation. Spores were harvested, filtered, 
diluted and plated onto R2T20 agar using standard procedures. Colonies were replica plated 

15 onto R2T20 plates with and without addition of thiostrepton. Colonies that could no longer 
grow on thiostrepton were selected and further grown in TSB medium. S. erythraea 18A1 
was isolated and confirmed using PCR and Southern blot analysis. The strain was designated 
LB-1 /BIOT-2634. For further analysis, the production of erythromycin was assessed as 
described in General Methods and the lack of erythromycin production was confirmed. In 

20 byconversion assays this strain did not further process fed erythronolide B and erythromycin 
D was hydroxy lated at C 12 to give erythromycin C as expected, indicating that EtyK was still 
functional. 

Bioconversion of tylactone to5-0- angolosaminyl tylactone 
25 Strain £ erythraea SGQ2pSGl448/27/91/4spn05/2p4/193/6tylMIIv/as grown and 

bioconversions with fed tylactone were performed as described in the General Methods. The 
cultures were extracted and analysed. A compound consistent with the presence of 
angolosaminyl tylactone was detected. 20 mg of this compound were purified and the 
structure was confirmed by NMR analysis. A compound consistent with the presence of 
30 angolosaminyl tylactone was also obtained when the gene cassette 

pSG\448/27/91/4spn05/2p4/193/6tylMII was expressed in the S. erythraea strain Q42/1 or S. 
erythraea 18A1. 

Table HI: NMR data for 5-Q- BP angolosaminyl Tvlactone 
# 8 C S H (mult., Hz) COSY H-H HMBC H-C 
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1 


174.4 


• 






2 


39.8 


1.91 d (16.8) 


2b 


1,3 




2.46 dd(16.8, 10.5) 


2a, 3 


1 


3 


66.9 


3.68 dd(10.5, 1.2) 


2b 


1 


4 


40.4 


1.56 m 


5, 18 


3 


5 


80.7 


3.76 d (10.3) 


4 


4, 7, 18, 19, 1' 


6 


38.7 


2.68 m 


7b 




7 


33.6 


1.45 m 
■X t<T 1.55 m 


6 




8 


45.0 


2.70 m 


21 




9 


203.9 








10 


118.3 


6.26 d (15.5) 


11 


12 


11 


147.7 


7.27 d (15.5) 


10 


9, 12, 13, 22 


12 


133.5 








13 


145.4 


5.60 d (10.4) 


14,22 


11, 14,22,23 


14 


38.3 


2.70 m 


13, 15, 23 


12, 13, 15,23 


15 


78.8 


4.68 td (9.7, 2.4) 


14, 16b 


1, 17 


16 


24.7 


1.55 m 
1.82 ddd 


15, 16b, 17 
16a, 17 


15 
18 


17 


9.6 


0.91 1 (7.2) 


16 


15, 16 


18 


9.7 


0.91 d (7.2) 


4 


3, 4, 5 


19 


21.0 


1.55 m 


20 




20 


11.8 


0.83 t (7.2) 


19 


6, 19 


21 


17.1 


1.15 d (6.8) 


8 


7,9 


22 


13.0 


1.76 s 


13 


11, 12, 13 


23 


16.1 


1.05 d (6.5) 


14 


13, 14, 15 


1' 


101.0 


4.41 d (8.6) 


2' 


2' 


2 


28.0 


1.48 m 


l',2b',3' 


1\3',4' 


2.05 ddd (10.4, 3.9, 1.6) 


2a', 3' 


r,3' 


3' 


65.8 


2.89 td (10.0, 3.9) 


2a', 2b', 4' 


4' 


4' 


70.5 


3.16 dd (9.5, 9.0) 


3', 5' 


3', 5', 6' 


5' 


73.2 


3.26 dq (9.6, 6.0) 


4', 6' 




6' 


17.7 


1.3 d (6.0) 


5' 





Isolation of plasmidpSGl 448/27/9 l/4spnOp5/2 
(pSG144cmgAIangAIIangMIIIspnOpcmgoffl4) 

Plasmid pSGLit35/2 (isolated from E. coli ET12567) was digested with Xbal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSGl448/27/91/4spnO (pSGlAAangAIangAIIangMIIIspnO). The ligation was used to 
transform E. coli DH10B and plasmid pSGl448/27/91/4spnOp5/2 
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(pSG144angAIangAIIangMUIspnOpangorfl4) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid pSGl 448/2 7/91/4spnOp5/24/19 
(pSG144angAIangAIIangMIIIspnOpangorfl4angB) 

Plasmid pSGUO.4/19 (isolated from E. coli ET12567) was digested with^&al and the insert 
fragment was isolated and ligated with the^&al digested vector fragment of 
pSG\448/27/91/4spnOp5/2 (pSGl44angAIangAIIangMIIIspnOpangorfl4). The ligation was 
used to transform E. coli DH10B and plasmid pSG\448/27/91/4spnOp5/24/19 
(pSGl44angAIangAIIangMIIIspnOpangorfl4angB) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmidpSG1448/27/91/4spnOp5/24/193/6 
(pSG144cmgAIcmgAIIangMIIspnOpangorfl4cmgBangMI) 

Plasmid pSGLit23/tf (isolated from E. coli ET12567) was digested with Xbal and the insert 
fragment was isolated and ligated with thecal digested vector fragment of 
pSG\448/27/91/4spnOp5/24/19 ($SG\44cmgAIangAIIcmgMUspnOpangorfl4angB). The 
ligation was used to transform E. coli DH10B and plasmid pSG\448/27/91/4spnOp5/24/I93/6 
(pSGl44cmgAIcmgAIIangMnispnOpangorfl4angBangAfI) was isolated using standard 
protocols. The construct was verified with restriction digests and sequence analysis. 

Isolation oj plasmid pSG J 448/27/9 l/4spnOp5/24/193/6angAHI 
(pSG144angAIcmgAIIangMUspnOpangorfl4angBangMIangMII) 

Plasmid pSGlAWangMII (isolated from E. coli ET12567) was digested with Xbal/BglR and 
the insert fragment was isolated and ligated with the Xbal and partial BglH digested vector 
fragment of pSG\448/27/91/4spnOp5/24/193/6 

(pSG\44angAIcmgAIIcmgMnispnOpangorfl4cmgBcmgMI). The ligation was used to 
transform E. coli DH10B and plasmid pSG\448/27/91/4spnOp5/24/193/6angMlI 
(pSG144angAIcmgMIangMHIspnOpangorfl4angBcmgMcmgA^ was isolated using 
standard protocols. The construct was verified with restriction digests and sequence analysis. 
The plasmid was used to transform mutant strains of S. erythraea with standard procedures. 

Biotransformation using S. erythraea Q42/1 P SG1448/27/91/4spnOp5/24/193/6atigMII 
(pSG144angAIangAIIangMIIIspnOpangorfl 4angBangMIangMH) 
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Biotransformation experiments feeding tylactone are carried out as described in General 
Methods and the cultures are analysed. Angolosaminyl tylactone is detected. 

Isolation of plasmidpSGl 448/27/96/4 (pSG144angAIangAIIangorf4) 
Plasmid pSG1448/27/9 (pSGl44angAIangAII) was digested with Xbal and treated with 
alkaline phosphatase using standard protocols. The vector fragment was used for ligations 
with Xbal treated plasmid pSGlAU6/4 no9 followed by transformations of E. coli DH10B 
using standard protocols. Plasmid pSG\448/27/96/4 (pSGl44angAIangAUangorf4) was 
isolated using standard procedures and the construct was confirmed by restriction digests and 
sequence analysis. 

Isolation of plasmid pSG1448/27/96/4p5/2 (pSG144angAIangAUangorf4pangoTfl4) 
Plasmid pSGLit35/2 (isolated from E. coli ET12567) was digested With Xbal and the insert 
fragment was isolated and ligated with the Xbal digested vector fragment of 
PSG1448/27/96/4 (pSGl44angAIangAIIangorf4). The ligation was used to transform E. coli 
DH10B and plasmid pSG\448/27/96/4p5/2 (pSGl44angAIangAIIangorf4pangorfI4) was 
isolated using standard protocols. The construct was verified with restriction digests and 
sequence analysis. 

Isolation of plasmid pSG1448/27/96/4p5/21/4 
(pSG144angAIangAIIangorf4pangorf!4angMIII) 

Plasmid pSGLit27/4 (isolated from E. coli ET12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSGl448/27/96/4p5/2 (pSGl44angAIangAIIangorf4pangorfl4). The ligation was used to 
transform E. coli DH10B and plasmid pSGl 445/2 7/96/4p5/21/4 

(pSG 1 44angAIangAIIangorf4pangorfl4angMIU) was isolated using standard protocols. The 
construct was verified with restriction digests and sequence analysis. 

Isolation of plasmid pSG1448/27/96/4p5/2 1/44/19 
(pSG144angAIangAIIangorf4pangorfl4angMIIIang£) 

Plasmid pSGLit24//P (isolated from E. coli ET12567) was digested with Xbal and the 1.4 kb 
fragment was isolated and ligated with the Xbal digested vector fragment of 
pSG\448/27/96/4p5/21/4 (pSG 1 44angAIangAUangorf4pangorfl4angMHI). The ligation was 
used to transform E. coli DH10B and plasmid pSG\448/27/96/4p5/2 1/44/19 
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(pSG\4AangAIangAUangorf4pangorfl4angMniangB) was isolated using standard protocols. 
The construct was verified with restriction digests and sequence analysis. 

Isolation ofplasmid P SG1448/27/96/4p5/21/44/193/6angMn 
(pSG144attgAImgAUangorf4pmgorfl4angMUangBangMangMII) 

Plasmid pSGl448/27/91/4spnOp5/24/193/6angMII was digested with BgKl and the about 2.2 
kb fragment was isolated and used to ligate with the BgM treated vector fragment of 
pSGlU8/27/96/4p5/2 1/44/1 9. The ligation was used to transform E. coli DH10B using 
standard procedures and plasmid pSG\448/27/96/4p5/21/44/193/6angMII 
(pSG\44angAIcmgAIIangorf4pangorfl4angMnangBcmgMangMIl) was isolated. The 
construct was verified using restriction digests and sequence analysis. The plasmid was used 
to transform mutant strains of S. erythraea with standard protocols. 

Byconversion oftylactone with S. erythraea Q42/1 P SG1448/27/96/4p5/21/44/193/6cmgMII 
(pSG144angMangAnangorf4pangorfl4angAdmangBmgMangMI) 
Biotransformation experiments feeding tylactone are carried out as described in General 
Methods and the cultures are analysed. Angolosaminyl tylactone is detected. 

Example 7: Cloning of eryK into the gene cassette pSG144 

Isolation ofplasmid pUC19eryK 
To amplify eryK primers eryKl 5'- 

G GTCTAGAC TACGCCGACTGCCTCGGCGAGGAGCCC-3' anderyK2: 5'- 
GGCATATGTTCGCCGACGTGGAAACG ACCTGCTGCG-5 ' were used and the PCR 
product was cloned as described for p\JCl9eryCVI. Plasmid pUC\9eryK was isolated. 
25 

Isolation of plasmid pLSBl 11 (pCJR24eryK) 

Plasmid pUC19eryiC was digested with NdeVXbal and the insert band was ligated with 
NdeVXbal digested pCJR24. Plasmid pLSB 1 1 1 (pCJK24eryK) was isolated and the construct 
was verified with restriction digests. 

30 

Isolation ofplasmid pLSBllS 

Plasmid pLSBl 1 1 (pCJKlAeryK) was digested with NdeVXbal and the insert fragment was 
isolated and ligated with the NdeVXbal digested vector fragment ofplasmid pSGLit2 and 
plasmid pLSBl 15 was isolated using standard protocols. The plasmid was verified using 
35 restriction digestion and DNA sequence analysis. 
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Isolation ofplasmid pSG 1448/27/9 5/2 l/4eryK 

Plasmid pLSBl 15 from E. coli ET12567 was digested with^&al and the insert fragment wa 
isolated and ligated with thecal treated vector fragment of pSGl448/27/95/21/4 
(pSGl44angAIangAHangorfl4angMIII). The ligation was used to transform E. coli DH10B 
with standard procedures and plasmid pSGl 448/2 7/95/21/4eryK 

{pSGl44angAIangAIIangorfl4angMIIIeryK) is isolated. The construct is confirmed with 
restriction digests. 

Isolation of plasmid pSG1448/27/95/21/4eryK4/19 

Plasmid P SGLit24/19 from E. coli ET12567 is digested with Xbal and the insert fragment is 
isolated and ligated with thecal treated vector fragment ofplasmid 
pSGl448/27/95/21/4eryK. The ligation is used to transform E. coli DH10B with standard 
procedures and plasmid pSG\448/27/95/21/4eryK4/19 

(pSG\44angAIangAnangorfl4angMineryKangB) is isolated. The construct is confirmed 
with restriction digests. 

Isolation ofplasmid pSG1448/27/95/21/4eryK4/193/6eryCIII 

Plasmid pSGl448/27/95/21/44/193/6eryCUI is digested with BglH and the about 2.1 kb 
fragment is isolated and ligated with the BglB. treated vector fragment of 
pSGl448/27/95/21/4eryK4/19. Plasmid pSGl 448/27/95/2 l/4eryK4/193/6eryCUI is isolated 
using standard procedures and the construct is confirmed using restriction digests. The 
plasmid is used to transform mutant strains of S. erythraea with standard methods. 

Byconversion of 3-O-mycarosyl erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl 
erythromycin A 

The S. erythraea strain Q42/lpSGl 445/2 7/95/21/4eryK4/193/6eryCUI is grown and 
byconversions with fed 3-O-mycarosyl erythronolide B are performed as described in the 
General Methods. The cultures are analysed and a compound with m/z 750 is detected 
consistent with the presence of 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A. 

Example 8: Production of 13-desethyI-13-methyl-5-0-mycaminosyl erythromycins A 
and B; 13-desethyl-13-isopropyI-5-0-mycaminosyI erythromycin A and B; 13-desethyI- 
13-secbutyI-5-0-mycaminosyl erythromycin A and B 
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Production of 13-desethyl-13-methyl-3-0-mycarosyl erythronolide B, 13-deseilryl-13- 
isopropyl-3-O-mycarosyl erythronolide B and 13-desethyl-13-secbutyl-3-0'mycarosyl 
erythronolide B 

Plasmid pLS025, (WO 03/033699) a pCJR24-based plasmid containing the DEBS1 , DEBS2 
and DEBS3 genes, in which the loading module of DEBS 1 has been replaced by the loading 
module of the avermectin biosynthetic cluster, was used to transform S. erythraea 
JC2AeryCm (isolated using techniques and plasmids described previously (Rowe et a/., 1998; 
Gaisser et aL y 2000)) using standard techniques. The transformant JC2AeryCIIIpLS025 was 
isolated and cultures were grown using standard protocols. Cultures of S. erythraea 
JC2AeryCmpLS025 are extracted using methods described in the General Methods section 
and the presence of 3-O-mycarosyI erythronolide B, 13-desethyl-13-methyl-3-0-mycarosyl 
erythronolide B, 13-desethyl-13-isopropyl-3-0-mycarosyl erythronolide B and 13-desethyi- 
13-secbutyl-3-0-mycarosyl erythronolide B in the crude extract is verified by LCMS analysis. 

Production oj ' 13'desethyl-13-methyU5^0'dedesosminyU5'0'mycaminosyl erythromycin A 
and B, 13^desethyl'13'isopropyU5-0'dedesosaminyl'5'0'mycaminosyl erythromycin A and B, 
13-desethyl-13'SecbutyU5'0-dedesosmmyU5'0'mycaminosyl erythromycin A and B 
Cultures of S. erythraea JC2AeryCDD[pLS025 are extracted using methods described in the 
General Methods section and the crude extracts are dissolved in 5 ml of methanol and 
subsequently fed to culture supernatants of the S. erythraea strain 

SGQ2pSGl448/27/95/2 1/44/1 93/6eryCIII using standard techniques. The bioconversion of 
13-desethyl-13-methyl-3-0-mycarosyi erythronolide B, 13-desethyl-13-isopropyl-3-0- 
mycarosyl erythronolide B and 13-desethyl-13-secbutyl-3-0-mycarosyi erythronolide B to 
13-desethyl-13-methyl-5-(9-dedesosaminyl-5-<9-mycaminosyl erythromycin A and 13- 
desethyl-13-methyU5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B; 13-desethyl-13- 
isopropyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin A and 13-desethyl-13- 
isopropyl-S-O-dedesosaminyl-5-O-mycaminosyl erythromycin B; 13 -desethy 1-13 -secbuty 1-5- 
O-dedesosaminyl-5-O-mycaminosyl erythromycin A and 13-desethyl-13-secbutyl-5-<9- 
dedesosammyl-5-O-mycaminosyl erythromycin B is verified by LCMS analysis. 

Example 9: 13-desethyH3-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin 
A and 13-desethyH3-methyI-5-0-dedesosaminyI-5-0-mycaminosyl erythromycin B 

Production of 13-desethyU13-methyl-3-0-mycarosyl erythronolide B 
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Plasmid pIB023 (Patent application no 0125043.0), a pCJR24-based plasmid containing the 
DEBS1, DEBS2 and DEBS3, was used to transform S. erythraea JC2AeryCHI using standard 
techniques. The transfonnant JC2AeryCIIIpIB023 was isolated and cultures were grown 
using standard protocols, extracted and the crude extract was assayed using methods 
5 described in the General Methods section. The production of 3-O-mycarosyl erythronolide B, 
and 13-desethyl-13-methyl-3-0-mycarosyl erythronolide B is verified by LCMS analysis. 

Production of 13-desethyl-13-methyU5'0-dedesosaminyl-5-0-mycaminosyl erythromycin A, 
13-desethyl-13-methyl-5-0-dedesosaminyl-5-0-mycaminosyl erythromycin B 
10 Cultures of S. erythraea JC2AeryCEIpIB023 are extracted using methods described in the 
General Methods section and the crude extracts are dissolved in 5 ml of methanol and 
subsequently fed to culture supernatants of S. erythraea 

SGQ2pSG144S/27/PJ/2 1/44/1 93/6eryCUl using standard techniques. The bioconversion of 
13-desethyl-13-methyl-3-0-mycarosyl erythronolide B to 13-desethyl-13-methyl-5-0- 
1 5 dedesosaminyl-5-O-mycaminosyl erythromycin A and 13-desethyl-13-methyl-5-0- 
dedesosaminyl-5-O-mycaminosyl erythromycin B are verified by LCMS analysis. 

Example 10: Production of 5-0-dedesosaminyI-5-0-mycaminosyl azithromycin 

20 Azithromycin aglycones were prepared using methods described in EP 10241 45 A2 (Pfizer 

Products Inc. Groton, Connecticut). The S. erythraea strain SGT2pSG142 was isolated using 
techniques and plasmid constructs described earlier (Gaisser et al., 2000). Feeding 
experiments are carried out using methods described previously (Gaisser et aL 9 2000) with the 
S. erythraea mutant SGT2pSG142 thus converting azithromycin aglycone to 3-O-mycarosyl 

25 azithronolide. Biotransformation experiments are carried out using S. erythraea 

SGQ2pSGl448/27/95/21/44/193/6eryClIIand crude extracts containing 3-O-mycarosyl 
azithronolide are added using standard microbiological techniques. The bioconversion of 3-O- 
mycarosyl azithronolide to S-O-dedesosaminyl-5-O-mycaminosyl azithromycin is verified by 
LCMS analysis. 

30 

Example 11: Production of S-O-dedesosaminyl-5-O-mycaminosyl erythromycin C 



Isolation of the S. erythraea mutant SGP1 (SGQ2AeryG) 

To create a chromosomal deletion in eryG 9 construct pSGAG3 was isolated as follows: 
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Fragment 1 was amplified using primers BIOSG53 5'- 
G GAATTCG GCCAGGACGCGTGGCTGGTCACCGGCT -3' and 

BIOSG54 5 ' -GGTCTAGAAAGAGCGTG AGC AGGCTCTTCT ACAGCC AGGTC A -3 ' and 
genomic DNA of S. erythraea was used as template. Fragment 2 was amplified using primers 
BIOSG55 5 ' -GGCATGC AGG AAGG AG AGAACC ACG ATG ACC ACCG ACG-3 ' and 
BIOSG56 5 ' -G GTCTAG AC ACC AGCCGTATCCTTTCTCGGTTCCTCTTGTG-3 ' and 
genomic DNA of S. erythraea was used as template. Both DNA fragments were cloned into 
Smal cut pUC19 using standard techniques, plasmids pUCPCRl and pUCPCR2 were isolated 
and the sequence of the amplified fragments was verified. Plasmid pUCPCRl was digested 
using EcoW/Xbal and the insert band DNA was isolated and cloned into EcoRUXbal digested 
pUC19. Plasmid pSGAGl is isolated using standard methods and digested with SphVXbal 
followed by a ligation with the SphVXbal digested insert fragment of pUCPCR2. Plasmid 
pSGAG2 is isolated using standard procedures, digested with SphVHindm and ligated with 
the SphVHindm fragment of pCJR24 (Rowe et al, 1998) containing the gene encoding for 
thiostrepton resistance. Plasmid pSGAG3 is isolated and used to delete eryG in the genome of 
S. erythraea strain SGQ2 using methods described previously (Gaisser et al., 1997; Gaisser et 
al., 1998) and the S. erythraea mutant SGP1 (SGQ2AeryG) is created. 

Production of5-0-dedesosaminyl-5-0-mycaminosyl erythromycin C 
The S. erythraea strain SGP1 (S. erythraea SGQ2AeryG) is isolated using standard 
techniques and consequently used to transform the cassette construct 
pSGl448/27/95/2 1/44/1 93/6eryCIH as formerly described. The S. erythraea strain 
SGPlpSG\448/27/95/21/44/193/6eryCIII is isolated and used for biotransformation as 
described in Example 2 and assays are carried out as described above to verify the conversion 
of 3-£>-mycarosyl-erythronolide B to 5-0-dedesosaminyl-5-0-mycaminosyl erythromycin C 
by LCMS analysis. 
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Figure 1A 




5-0-dedesosaminyl-5-0-mycaminosyl-erythromycin B 

R 1 = C 2 H 5 R 2 = R 4 = R 5 = R 6 = R 7 = R 9 = -CH 3 R 3 = -H R 8 = 



5-0-dedesosaminyl-5-0-mycaminosyl-erythromycin A 
R 1 = C 2 H 5 R 2 = R 4 = R 5 = R 6 = R 7 = R 9 = -CH 3 R 3 = -OH R 8 



5-0<ledesosaminyl-5--0-mycaminosyl-erythromycin C 
R 1 = C 2 H 5 R 2 = R 4 = R 5 = R 6 = R 7 = R 9 = -CH 3 R 3 = -OH R 8 




5-0-dedesosaminyl-5-0-mycaminosyl-azithromycin 
R 1 = C 2 H 5 R 2 = R 4 = R 5 = R 6 = R 7 = R 9 = -CH 3 R 3 = -OH R 8 
CH 3 
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Figure 7 

1 GGCATGCCTT CGGGGTGTGC GGCGGCGCCT CAGAGCGTGG CCAGTACCTC 
51 GTGCAGGGCC GCGATCACCT TGTCCTGTAC GTCGGGCGCG AGCCCCGGGT 
101 ACATCGGCAG CGAGAAGATC TCGTCCGCCA GCCGCTCCGT CACCGGCAGC 
151 GAGCCCTTGG CGTACCCCAG GTGCGCGAAG CCCGTCATGG TGTGCACGGG 
201 CCACGGGTAA CTGATGTTGA GCGAGATCCC GTACGACTTG AGCGCCTCGA 
251 TGATGTCGTC CCGGCGCGGG TGGCGGACGA CGTACACGTA ATACACGTGG 
301 TCGTTGCCCT CGGTGACGGA CGGCAGCACC AGGCCGCCGG GGCCCGTCAG 
351 GTTCGCGAGT CCTTCGGCGT AACGCCGGGC GACCGCGCGC CGGCCCTCGA 
401 TGTAGCGGTC GAGGCGGGTG AGCTTGCGGC GCAGGATCTC CGCCTGCACC 
451 TCGTCGAGCC GGCTGTTGTG GCCGGGCGTC TGCACGACGT AGTACACGTC 
501 CTCCATGCCG TAGTAGCGCA GCCGGCGCAG CGCACGGTCG ACGTCCGCGT 
551 CGTCGGTCAG CACGGCCCCG CCGTCGCCGT ACGCACCGAG GACCTTCGTC 
601 GGGTAGAACG AGAAGGCGGC GGCGTCGCCC AGCGTGCCGG CCAGCTCGCC 
651 GTGGTGGCGG GCACCGTGCG CCTGGGCGCA GTCCTCCAGC ACCACCAGGC 
701 CGTGCTGCTC GGCCAGGGCG CGCAAGGGCG CCATGTCGAC GCACTGCCCG 
751 TACAGGTGCA CCGGCAGCAG GGCCTTCGTG CGCGGGGTGA TGACGTCCGC 
801 GACCTGGTCG GTGTCCATGA GGTGGTCCTC GGCGCGGACG TCGACGAAGA 
851 CGGGCGTGGC ACCGGTGCCG TCGATGGCCA CCACCGTCGG CGCGGCCGTG 
901 TTGGAGACGG TGACGACCTC GTCCCCCGGG CCCACCCCGA GCGCCTGCAG 
951 ACCCAGCTTG ACGGCGTTGG TGCCGTTGTC GACACCGCCG CAGTGGCGCA 
1001 GGCCGTGGTA GTCCGCGAAC TCQTTCTCGA ACCCGTCCAC GCTGGGGCCG 
1051 AGGACCAACT GCCCGGAGGC GAAGACGGTC TCGACGGCGT CGAGGAGGTC 
1101 CGCGCGTTCG TTCTGGTATT CCGCCAGGTA GTCCCAGACG TAGGTAGTCA 
1151 CGGAGAGCTC AACCTCCAGA GTGTTTCGAT GGGGTGGTGG GAAGCCGGTG 
1201 CGCGCGGACC AGGTCGTGCC AGCAGTCGCG GACCGACTCC CGCAGCGAAC 
1251 GGCGCGGTGC CCAGCCCAGC AGGGCGCGCG CCGCGCCGGT GTCGACCCGC 
1301 AGCCAGTCCT CCCGGTGCCC GGGAGCCCGG CCCGGAGCCG GGCGCTCCAC 
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1351 CACCCGCGCC GGAATGCCGC TCGCCTCGAT GAACAGGCCG ACCAGGTCGC 
1401 GGACGGCGAC CGCCTCGCCC CGCCCGATGC CGACGGCGAC CGGGACGGCC 
1451 GGTGCGCGGG CGGCGGCCAC GACGGCGTCG GCCACGTCCC GCACATCGAC 
1501 GTAGTCCCGG TGCGCGCGCA GCCGGGACAG TTCCACGACG GCCTCCGCAC 
1551 CCGTCCCGGC GGCCGCCAGC AGCCGCTCGG CGACCTGGCC CAGCAGACTG 
1601 ATCCGCGGGG TGCCGGGGCC CGACACGTTG GACACCCGTA GCACCACACC 
1651 GTCGACCCAC CCGCCCGAGG TGCCCCGCAG CACCGCCTCG CTGGCGGCGA 
1701 GCTTGCTCCT GCCGTACGCC GTGTCCGGGC GCGGTACGGC GTCGGCGCCC 
1751 ACCGAACCGC CGGGCGTCAC CGGGCCGTAC TCCAGTACCG AGCCGAGGTG 
1801 GACCAGCCGC GGCCGCGCGG ACATCAGCGC CAGCGCCTCC AGCAGGCGCA 
1851 GCGTGGGCAC CGCGGTGGCG GACCACATCT GCTCGTCGGT ACGGCCCCAG 
1901 ATGCTTCCGA CGGAGTTGAC GATCGTGTCC GGACGCTCCG CGTCCAGGGC 
1951 GGCGGCCAGC GCCGCGGGAT CCGTACCGGC CAGGTCCAGG GTGACGCAGC 
2001 GGTACGGCAT CGGCTCCTCG GGCGGGCGGC GGCCCACCAC CACCACGTCA 
2051 CGGCCCCGCG CGGCGAACGC CGCGCACACA TGCCGGCCGA CGTACCCGGC 
2101 GCCGCCCAGG ACCACGACGC TGCCACTGCC ACTGCCGCGC GGCATCGGAT 
2151 CGTTCACCAT 



Figure 8 

11301 CGTCAGTACA GCGTGTGGGC ACACGCCACC AGGGTGCGCA GCTCGATGTT 

11351 GAGGTAGTTG CCGTGCGCCA GCAGCCCGGT GAGCTGACCG AGCGACAGCC 

11401 AGGCGAAGTC GTCCGGTGCG TCCTCCGGGA AGTCGTGCGG GACCTCCACG 

11451 ATCACGTAGC GGTTCTGGGC GTGGAAGAAG CGCCCGCCCT CCTCGGACTG 

11501 GACGGCGTCG TAGCGCACGT CCTGAGGCGG CGCGGACAGC ACGTCCTCCA 

11551 GGTACGGCGG GCCGGGCAGC CCCCGCGGAC CGGTGTGCTC CTGTGGCCGG 

11601 CACTGGACCG TGGGGGCCAG CTCGGCGACG TTCAGGTGCC CGACGTCCAC 

11651 CCGTGCCCGC ACGAGCGCGT GCAGCACGCC GTCGACGGAC TTGACCAGCA 

11701 GCGCCATCAG. ACCCGGCAGC CGCGGCTCGA TGAGCGGCTG CGTCCAGGAG 

117 51 GTGACCTCCC GGCTGCTGGC GCTGACCTCG GCGGCCATGA CCCGGAAGTG 

11801 CCGCCCGCTC TCGTGGGCGA TCTCGTGCGG CGTGCGGTAC CAGCCGTCCG 

11851 CCGTCACCGT ATCGAGCGGC ACCCGGTTCT GCACCAGCTC CCGCAGGGCG 

11901 CGCACACCCG TGAACCACGT CAGGACCTCG GCCGTCGTGT GCCGCGCCGC 

11951 ACCCGGCGAG CCGAAGAAGG AGCGCAGCAC GGGGGACGGG GCGGACGCGT 

12001 CGGCGTCCGC CGTGGGCAGG CAGGCGAGGA TGGACCGGGC GTCCATGTTG 

12051 ACCACGTTGT CCAGCATCAG CAGCCGGCGG AGCTGCCCCA GCGTCAGCCA 

12101 GCGGAAGTCC TCCCCGATGT CGAGGTCGTC GTCCGCCGCC AACTCGACGA 

12151 TCATGTTCCG GTTGCGTTTG GCCAGGAACC AGTCCGCCTG TTCGGACTGG 

12201 ATCGAGTCGA CCAGGACACG CGCCCGTCGC GGCCCCATGA ACAGGTCCAG 

12251 ATAGCGGATG TCGCGCCCCC GGTGCACCCC GGTGAAGTTG CTCCGGGTGG 

12301 CCTGCACGGT CGGCGACACC TGAAGAACGT TGACGTTCCC GGGCTCCATC 

12351 TTGGCCTGCA TCAGGAAGTG CAGCACCCCG TCGATCTCCC GCGCCACGAT 

12401 CCCGAGCAGC CCCACCTCCG GCTGCACGAT GATGGGCTGC GTCCAGCCCC 

124 51 GCTCGGGCAG CCGGTCCGTA CGGACGTGCA GCCCCTCCAC GGAGAAGAAA 

12501 CGGCCCGACG CGTGGTGCAG GTTTCCCGTA CCCGGGTGGA AGCTCCAGCC 

12551 GCGCAGCTCC GCGAAGGGAA CGCGGGACAC GTCGAAGCGC CCCGCCCGCA 

12601 GGCGTTCGGC CAGCCAGCCG GAGATGCCGT CGAACGGCGT GACCGCACTG 
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12651 TCCGCGGTGC GTGCCGACAC CAGCACCCGC CGCGCCGTGT CCACCGGGTC 
12701 ACCGGGCCGG ACCGCGTCCG CACGGCGCCG CGCGGCGCCG TGCGGGGCGG 
127 51 GGGCGGATCG CGGCGGTACG GGTTCGCGGG CGGTGTCCGC GGCGGTGCGC 
12801 GGCGGGACGG GGCCGGTGCT CGTGTCCGCG GCGGTACGCG GTGGGACGGT 
12851 CCCGGTGGCC GTGTCCGCGG TGGCCGTGCC GGCGAGGGCG TCGCCGATGG 
12901 TCCGGCACAC CTCGTCCATC CGGTCGTTCA GATAGAAGTG ACCGCCGGCG 
12951 AAGGTGTGCA GGGCGAAGGG GCCCGTGGTC AGCTCCCGCC AGGCCCTCGC 
13001 CTCCTCCAGC GGGACATCGG GATCACGGTC ACCGGTGAGC ACCGTGACCG 
13051 GACAGTCCAG CGCACCGCCG GGCACATACG CGTACGTGCC CGCCGCCCGG 
13101 TAGTCGTTGC GGATCGCCGG CAGGGCCAGC CGCAGCAGCT CCTCGTCCTG 
13151 GAGGACGGCG TCCTCGGTGC CCTGAAGCGT GGCGATCTCC GCGATCAGCG 
13201 CGTCGTCGTC GAGGAGGTGG GCGACGTCCC GCCGGCGCAC CGTCGGCGCA 
13251 CGGCGGCCCG ACACCAGCAG ATGGACGGGG GAGGCCTGCC CGGAACCGCG 
13301 CAGCCGGCGC GCGACCTCGA ACGCCACCGT GGCACCCATG CTGTGCCCGA 
13351 ACAGCGCGAG CGGACGGTCG GCCCAGCGCA GGATCTCCGG CACCACCTGG 
13401 TCCACCAGGC CCGATATGGA CGGGATGAAC GGCTCGTGCC GGCGGTCCTG 
13451 GCGGCCCGGG TACTGCACCG CCAGCGCCTC CACGGTCTCG TCCAGTCCGC 
13501 GTGCCAGGGC GGCGAAGGAG GTCGCGGCGC CACCGGCGTG CGGGAAGCAG 
13551 ACCAGACGCA GTTCCGGATC CCGCACCGGG CGGTAACGGC GGACCCACAG 
13601 ACCCTCGTCC GGGTGTCCGG CCGGCGACGG GGCTCCCGGA ACGGGTGGTG 
13651 CGGAAGGGGT GCTCACGGCG GATCCAGCTC CTCGCGGTCG GGGGGACCGC 
13701 TGTCGGGGAC GGCACGTCGG GTGCGGACGT CGGGTACGGG CGTCGGGGCG 
137 51 TGACGGGGAG GGACGGGGCG GTCGGTCAGT CGGTGCGCCG GGCCTCCTGC 
13801 GCGGCCTTCT TCAGCGGTTC CCACCACGCG CGGTTCTCCG CGTACCAGCG 
13851 CACCGTGTCC GCCAGGCCCG TCGTGAAGTC CGTACGCGGG GCATAGCCCA 
13901 GCTCGCCCGT GATCTTGCCG ATGTCCAGCG CGTACCGCAG GTCGTGCCCC 
13951 GGCCGGTCGG CGACGTGGCG CACCGACGAG GCGTCGGCAC CGCACAGCCC 
14001 GAGCAGCCGC TTCGTCAGCT CCCGGTTGGT CAGCTCCGTC CCGCCACCGA 
14051 TGTGGTAGAC CTCGCCCGGG CGCCCGCGGG TCGCCACCAG GCTGATCCCG 
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14101 CGGCAGTGGT CGTCCACGTG CAGCCAGTCC CGGCTGTTGC CGCCGTCGCT 
14151 GTACAGCGGC ACCGTCAGAC CGTCCAACAG GTTCGTGGCG AAGAGCGGGA 
14201 CGACCTTCTC GGGGTGCTGG TACGGGCCGT AGTTGTTGGA GCACCGGGTG 
14251 ACGAC GACCG GCAGGCCGTA CGTCCGGTGG TAGGCCAGCG CCAGGAGGTC 
14301 CGACGCCGCC TTCGAGGCGG CGTACGGGGA GTTCGGCGCC AGCGGCTGCT 
14351 CCTCGCGCCA CGACCCCTCG GCGATCGAGC CGTACACCTC GTCCGTGGAG 
14 401 ACGTGGACGA ACCGGCCGGC CCCCGCCTCC ACCGCGGCCT GCAAGAGGAC 
14451 TTGCGTCCCC CGTACGTTCG TCTCGACGAA CGCCGACGCG TCGGCGATGG 
14501 AGCGGTCCAC GTGCGACTCC GCCGCGAAGT GGACCACGAC GTCCGCCCCC 
14551 CGCACGACCC GGGACATCAC CTCCGCGTCC CGGATGTCGG CGTGCACGAA 
14601 CTCCAGCGAC GGATGGTCCG CGACCGGGTC CAGGTTGGCG AGGTTCCCGG 
14651 CATAGGTCAG CTTGTCGACC ACCACCGTCC GCGCCCCGGC CAGGT CCGGA 
14701 TACGCCCCGG CCAGCAGTTG TCTGACGAAG TGCGAGCCGA TGAAGCCCGC 
14751 ACCTCCGGTG ACCAGCAGCC GCATGGGAGC ACAGACCTTT CTTCCAGGGA 
14801 CGGGAAACGG GGAGGCGGAC GGGGACGGAG GCGAGGGCGG TGGCTATGCG 
14851 GCCGGTCCGG ACATGAGGGT CTCCGCCACG TCCATCAAGT ACCGGCCGTA 
14 901 GCTGGAGCTC TCGAGTTCAC GGCCGAGCTC GTGGCACTGC CGCGCGCTGA 
14951 TGTACCCCAT CCGCAGGGCG ATCTCCTCGA CGCAGGAGAT CCGCACGCCC 
15001 TGCCGCTGCT CCAGGAGCTG GACGTACTGC CCCGCTTGCA GCAGCGAGCT 
15051 GTGCGTGCCC ATGTCCAGCC AGGCGAACCC GCGCCCCAGT TCCGTCATAC 
15101 GGGCGCGGCC CTGCTCCAGG TACACCTTGT TGACGTCGGT GATCTCCAGC 
15151 TCGCCCCGCG GCGACGGTGT CAGCCGCCGG GCGATGTCCA CCACGCCGTT 
15201 GTCGTAGAAG TACAGCCCCG TCACCGCGAG ATGGGAGCGG GGCTTCTCCG 
15251 GCTTCTCCTC CAGGGACACC AGCCGGCCTT CCGCGTCGAC CTCGCCGACG 
15301 CCGTAGCGCC GGGGGTCCTT CACCGGGTAG CCGAACAGCT CGCAGCCGTC 
15351 CAGCCGCGCC GCGGTGGAGG CCAGCACGGA GGAGAACCCC GGACCGTGGA 
15401 AGACGTTGTC CCCCAGGATG AGGGCGACCG GGTCGTCCCC GATGTGCTCC 
15451 TCGCCGATGA GGAACGCCTC GGCGATGCCC CGGGGCTCCT CCTGCTCGGC 



15501 GTAGCCGACA CTGATCCCGA 

15551 ACATCTCCAA GTGCGTCTTC 

15601 GCCAGCATGA GCACCGACAG 

15651 CGGCAGCAAC TGCTTGGACA 

15701 CGCTGCCGCC CGCCAGGATG 

15751 GTCTTCGTCA T 
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TGCGGCTGCC GTCGCCCAGC AGCGAAC GGA 
GACGTGATGA TCTGGATGTC CCGGATCCCC 
CGGGTAGTAG ATCATGGGCT TGTCGTAGAC 
GTGCCCCGGT CAGGGGGCGC AGGCGCGTGC 
ATGCCCTTCA TGGGCCGCCG GTCCGCCGTC 
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59800 G 

59801 TGAGCCCCGC ACCCGCCACC GAGGACCCGG CCGCCGCCGG GCGCCGCCTG 
59851 CAACTGACCC GCGCAGCCCA GTGGTTCGCG GGAACCCAGG ACGACCCGTA 
59901 CGCGCTCGTC CTGCGCGCCG AGGCCACCGA CCCGGCCCCG TACGAGGAGC 
59951 GGATCCGGGC CCACGGGCCG CTCTTCCGCA GCGACCTGCT CGACACCTGG 
60001 GTCACGGCGA GCAGGGCCGT CGCCGACGAA GTGATCACCT CACCCGCCTT 
60051 CGACGGGCTC ACGGCCGACG GGCGGCGCCC CGGCGCGCGG GAACTGCCGC 
60101 TGTCCGGCAC CGCGCTCGAC GCGGACCGCG CCACATGCGC ACGGTTCGGG 
60151 GCCCTCACCG CCTGGGGCGG GCCGCTGCTG CCGGCGCCGC ACGAGCGGGC 
60201 GCTGCGCGAG TCCGCCGAAC GGCGGGCCCA CACACTCCTC GACGGGGCGG 
60251 AGGCCGCCCT GGCCGCCGAC GGCACCGTCG ACCTCGTCGA CGCGTACGCC 
60301 CGCAGGCTCC CCGCGCTGGT CCTCCGCGAA CAGCTCGGCG TGCCGGAGGA 
60351 GGCGGCGACC GCCTTCGAGG ACGCGCTGGC CGGCTGCCGC CGCACCCTGG 
60401 ACGGCGCCCT GTGCCCGCAA CTCCTCCCGG ACGCCGTGGC GGGGGTGCGC 
'60451 GCGGAAGCCG CGCTGACCGC CGTGCTGGCC TCCGCCCTGC GCGGGACTCC 
60501 GGCCGGCCGG GCCCCCGACG CCGTCGCCGC CGCCCGCACC CTGGCCGTCG 
60551 CGGCCGCCGA GCCCGCAGCC ACCCTCGTCG GCAACGCCGT ACAGGAGCTG 
60601 CTGGCGCGTC CCGCGCAGTG GGCGGAGCTC GTACGCGACC CGCGCCTCGC 
60651 GGCCGCCGCG GTGACCGAAA CGCTGCGTGT CGCCCCGCCC GTCCGCCTGG 
60701 AGCGGCGGGT CGCCCGCGAG GACACGGACA TCGCCGGGCA GCGCCTCCCC 
607 51 GCCGGGGGGA GCGTCGTGAT CCTCGTCGCC GCCGTCAACC GCGCGCCCGT 
60801 ATCCGCGGGA AGCGACGCCT CCACCACCGT CCCGCACGCC GGCGGCCGGC 
60851 CCCGTACCTC CGCCCCCTCC GTCCCCTCAG CCCCCTTCGA CCTCACACGG 
60901 CCCGTGGCCG CGCCCGGGCC GTTCGGGCTC CCCGGCGACC TGCACTTCCG 
60951 CCTCGGCGGG CCCCTGGTCG GAACGGTCGC CGAAGCCGCG CTCGGTGCGC 
61001 TGGCCGCACG GCTCCCCGGT CTGCGCGCCG CCGGGCCGGC CGTGCGGCGC 
61051 CGCCGCTCAC CGGTGCTGCA CGGACACGCC CGCCTCCCCG TCGCCGTCGC 
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61101 CCGGACGGCC CGTGACCTGC CCGCCACCGC ACCGCGGAAC TGAGGAGGGA 
61151 GTGCCCCGAT GCGTATCCTG CTGACGTCGT TCGCGCACAA CACGCACTAC 
61201 TACAACCTGG TCCCCCTCGG CTGGGCGCTG CGCGCCGCCG GGCACGACGT 
61251 ACGGGTCGCC AGCCAGCCCT CGCTGACCGG CACCATCACC GGCTCCGGGC 
61301 TGACCGCCGT CCCCGTGGGC GACGACACGG CCATCGTCGA GCTGATCACC 
61351 GAGATCGGCG ACGACCTCGT CCTCTACCAG CAGGGCATGG ACTTCGTGGA 
61401 CACCCGCGAC GAGCCGCTGT CCTGGGAACA CGCCCTCGGA CAGCAGACGA 
61451 TCATGTCGGC CATGTGCTTC TCGCCGCTGA ACGGCGACAG CACCATCGAC 
61501 GACATGGTGG CGCTGGCCCG TTCCTGGAAA CCGGACCTCG TCCTGTGGGA 
61551 GCCCTTCACC TACGCGGGAC CCGTCGCCGC GCACGCCTGC GGCGCCGCCC 
61601 ACGCCCGGCT GCTGTGGGGT CCCGACGTGG TCCTCAACGC ACGGCGGCAG 
61651 TTCACCCGGC TGCTCGCCGA GCGCCCCGTC GAACAGCGCG AGGACCCGGT 
61701 CGGCGAATGG CTCACGTGGA CGCTGGAGCG CCACGGCCTC GCCGCCGACG 
61751 CGGACACGAT CGAGGAACTG TTCGCCGGGC AGTGGAC GAT CGACCCCAGC 
61801 GCCGGGAGCC TGCGGCTGCC GGTCGACGGC GAGGTCGTGC CCATGCGCTT 
61851 CGTGCCGTAC AACGGCGCCT CGGTCGTCCC CGCCTGGCTC TCCGAGCCGC 
61901 CTGCCCGGCC CCGGGTCTGC GTCACCCTCG GCGTCTCCAC CCGGGAGACC 
61951 TACGGCACGG ACGGCGTCCC GTTCCACGAA CTGCTGGCCG GACTGGCCGA 
62001 CGTGGACGCC GAGATCGTCG CCACCCTCGA CGCGGGGCAG CTCCCGGACG 
62051 CCGCCGGTCT GCCCGGCAAT GTGCGCGTCG TCGACTTCGT GCCGCTGGAC 
62101 GCCCTGCTGC CGAGCTGCGC CGCGATCGTC CACCACGGAG GCGCGGGAAC 
62151 CTGTTTCACG GCCACCGTGC ACGGCGTCCC GCAGATCGTC GTGGCCTCCC 
62201 TCTGGGACGC GCCGCTGAAG GCGCACCAAC TCGCCGAGGC GGGCGCCGGG 
62251 ATCGCCCTGG ACCCCGGGGA ACTGGGCGTG GACACCCTGC GCGGCGCCGT 
62301 CGTGCGGGTG CTGGAGAGCC GCGAGATGGC CGTGGCGGCG CGTCGCCTCG 
62351 CCGACGAGAT GCTCGCCGCC CCCACCCCGG CCGCGCTCGT CCCCCGCCTC 
62 401 GAACGCCTCA CCGCCGCGCA CCGCCGCGCC TGATCCCGCC AAGGAGCCCC 
62451 CATGAACCTC GAATACAGCG GCGACATCGC CCGGTTGTAC GACCTGGTCC 
62501 ACCAGGGAAA GGGCAAGGAC TACCGGGCGG AGGCCGAGGA GCTGGCCGCG 
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62551 


CTTGTCACCC 


AGCGCCGCCC 


62601 


CGGAACGGGG ATGCACCTGC 


62651 


CCGGGGTGGA 


GATGTCCCCC 


62701 


CCGGAGGCCG 


GCATCCACCG 


62751 


CCGCTTCGAC 


GCCGTGATCT 


62801 


ACCAGCGGGA ACTGGACGCG 


62851 


TCCGGCGGGG 


TCGTGATCGT 


62901 


ACCGGGGTAC 


GTCGGCGCGA 


62951 


CGCGCTTCTC 


CCACTCCGCG 


63001 


GACTACCTCG 


TCGGCGTGCC 


63051 


CCATCGGATC 


ACGCTTTTCG 


63101 


CGGCGGGGAT 


GTCCGTCGAG 


63151 


CTCTTCGTCG 


GCGTCCAGGC 



CGGGGCCCGC TCCCTCCTCG ACGTGGCCTG 
GGCACCTCGG CGACCTCTTC GAGGAGGTGG 
GACATGCTGG CCATCGCGCA GCGGCGCAAC 
GGGGGACATG CGGGACTTCG CCCTCGGCCG 
GCATGTTCAG TTCCATCGGG CACATGCGCG 
GCGATCGGCC GGTTCGCCGC GCACCTGCCG 
CGATCCCTGG TGGTTCCCGG AGACGTTCAC 
GCCTCGTCGA GGCCGAGGGC CGCACCATCG 
CTCGAGGACG GCGCGACCCG GATCGATGTG 
GGGGGAGGGG GTGCGGCACT TGAAGGAGAC 
GGCGTGCGCA GTACGAGGCG GCCTTCACCG 
TACCTCCCGC ACGCCGCCAC CGACCGCGGA 
CTGA 



17/24 
Figure 10 

1 MKGIILAGGS GTRLRPLTGA LSKQLLPVYD KPMIYYPLSV LMLAGIRDIQ 
51 IITSKTHLEM FRSLLGDGSR IGISVGYAEQ EEPRGIAEAF LIGEEHIGDD 
101 PVALILGDNV FHGPGFSSVL ASTAARLDGC ELFGYPVKDP RRYGVGEVDA 
151 EGRLVSLEEK PEKPRSHLAV TGLYFYDNGV VDIARRLTPS PRGELEITDV 
201 NKVYLEQGRA RMTELGRGFA WLDMGTHSSL LQAGQYVQLL EQRQGVRISC 
251 VEEIALRMGY ISARQCHELG RELESSSYGR YLMDVAETLM SGPAA 
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Figurell 

1 MRLLVTGGAG FIGSHFVRQL LAGAYPDLAG ARTVWDKLT YAGNLANLDP 
51 VADHPSLEFV HADIRDAEVM SRWRGADW VHFAAESHVD RSI ADAS AFV 
101 ETNVRGTQVL LQAAVEAGAG RFVHVSTDEV YGSIAEGSWR EEQPLAPNSP 
151 YAASKAASDL LAIiAYHRTYG LPVWTRCSN NYGPYQHPEK WPLFATNLL 
201 DGLTVPLYSD GGNSRDWLHV DDHCRGISLV ATRGRPGEVY HIGGGTELTN 
251 RELTKRLLGL CGADASSVRH VADRPGHDLR YALDIGKITG ELGYAPRTDF 
301 TTGLADTVRW YAENRAWWEP LKKAAQEARR TD 
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1 


VSTPSAPPVP GAPSPAGHPD EGLWVRRYRP VRDPELRLVC FPHAGGAATS 


51 


FAALARGLDE 


TVEALAVQYP 


GRQDRRHEPF 


IPSISGLVDQ 


WPEILRWAD 


101 


RPLALFGHSM 


GATVAFEVAR 


RLRGSGQASP 


VHLLVSGRRA 


PTVRRRDVAH 


151 


LLDDDALIAE 


IATLQGTEDA VLQDEELLRL ALPAIRNDYR AAGTYAYVPG 


201 


GALDCPVTVL 


TGDRDPDVPL 


EEARAWRELT 


TGPFALHTFA 


GGHFYLNDRM 


251 


DEVCRTIGDA 


LAGTATADTA 


TGTVPPRTAA 


DTSTGPVPPR 


TAADTAREPV 


301 


PPRSAPAPHG 


AARRRADAVR 


PGDPVDTARR 


VLVSARTADS 


AVTPFDGISG 


351 


WLAERLRAGR 


FDVSRVPFAE 


LRGWSFHPGT 


GNLHHASGRF 


FSVEGLHVRT 


401 


DRLPERGWTQ 


PIIVQPEVGL 


LGIVAREIDG VLHFLMQAKM 


EPGNVNVLQV 


451 


SPTVQATRSN 


FTGVHRGRDI 


RYLDLFMGPR 


RARVLVDSIQ 


SEQADWFLAK 


501 


RNRNMIVELA 


ADDDLDIGED 


FRWLTLGQLR 


RLLMLDNWN 


MDARSILACL 


551 


PTADADASAP 


SPVLRSFFGS 


PGAARHTTAE 


VLTWFTGVRA 


LRELVQNRVP 


601 


LDTVTADGWY 


RTPHEIAHES 


GRHFRVMAAE 


VSASSREVTS 


WTQPLIEPRL 


651 


PGLMALLVKS 


VDGVLHALVR 


ARVDVGHLNV 


AELAPTVQCR 


PQEHTGPRGL 


701 


PGPPYLEDVL 


SAPPQDVRYD 


AVQSEEGGRF 


FHAQNRYVIV 


EVPHDFPEDA 


751 


PDDFAWLSLG 


QLTGLLAHGN 


YLNIELRTLV 


ACAHTLY 
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Figure 13 

1 MVNDPMPRGS GSGSVWLGG AGYVGRHVCA AFAARGRDW WGRRPPEEP 
51 MPYRCVTLDL AGTDPAALAA ALDAERPDTI VNSVGSIWGR TDEQMWSATA 
101 VPTLRLLEAL ALMSARPRLV HLGSVLEYGP VTPGGSVGAD AVPRPDTAYG 
151 RSKLAASEAV LRGTSGGWVD GWLRVSNVS GPGTPRISLL GQVAERLLAA 
201 AGTGAEAWE LSRLRAHRDY VDVRDVADAV VAAARAPAVP VAVG I GRGEA 
251 VAVRDLVGLF IEASGIPARV VERPAPGRAP GHRE DWLRVD TGAARALLGW 
301 APRRSLRESV RDCWHDLVRA HRLPTTPSKH SGG 
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Figure 14 

1 VTTYVWDYLA EYQNERADLL DAVETVFASG QLVLGPSVDG FEKEFADYHG 
51 LRHCGGVDNG TNAVKLGLQA LGVGPGDEW TVSNTAAPTV VAI DGTGATP 
101 VFVDVRAEDH LMDTDQVADV ITPRTKALLP VHLYGQCVDM APLRALAEQH 
151 GLWLEDCAQ AHGARHHGEL AGTLGDAAAF SFYPTKVLGA YGDGGAVLTD 
201 DADVDRALRR LRYYGMEDVY YWQTPGHNS RLDEVQAEIL RRKLTRLDRY 
251 IEGRRAVARR YAEGLANLTG PGGLVLPSVT EGNDHVYYVY WRHPRRDDI 
301 IEALKSYGIS LNISYPWPVH TMTGFAHLGY AKGSLPVTER LADEIFSLPM 
351 YPGLAPDVQD KVTAALHEVL ATL 
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Figure 15 



1 VSPAPATEDP AAAGRRLQLT RAAQWFAGTQ DDPYALVLRA EATDPAPYEE 

51 RIRAHGPLFR SDLLDTWVTA SRAVADEVIT SPAFDGLTAD GRRPGARELP 

101 LSGTALDADR ATCARFGALT AWGGPLLPAP HERALRESAE RRAHTLLDGA 

151 EAALAADGTV DLVDAYARRL PALVLREQLG VPEEAATAFE DALAGCRRTL 

201 DGALCPQLLP DAVAGVRAEA ALTAVLASAL RGTPAGRAPD AVAAARTLAV 

251 AAAE P AATLV GNAVQELLAR PAQWAELVRD PRLAAAAVTE TLRVAP PVRL 

301 ERRVAREDTD IAGQRLPAGG SWILVAAVN RAPVS AGS DA STTVPHAGGR 

351 PRTSAPSVPS APFDLTRPVA APGPFGLPGD LHFRLGGPLV GTVAEAALGA 

401 LAARLPGLRA AGPAVRRRRS PVLHGHARLP VAVARTARDL PATAPRN 
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Figure 16 
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Figure 17 

1 MNLEYSGDIA RLYDLVHQGK GKDYRAEAEE LAALVTQRRP GARS LLD VAC 
51 GTGMHLRHLG DLFEEVAGVE MSPDMLAIAQ RRNPEAGIHR GDMRDFALGR 
101 RFDAVICMFS SIGHMRDQRE LDAAIGRFAA HLPSGGWIV DPWWFPETFT 
151 PGYVGASLVE AEGRTIARFS HSALEDGATR IDVDYLVGVP GEGVRHLKET 
201 HRITLFGRAQ YEAAFTAAGM SVEYLPHAAT DRGLFVGVQA ' 
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